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TRANSCRIPTION FACTOR MODULATING COMPOUNDS AND METHODS 

OF USE THEREOF 

Related Applications 

5 This application claims priority to U.S. Provisional Patent Application 

No. 60/425,916, filed November 13, 2002; and U.S. Provisional Patent Application No. 

60/423,319, filed November 1, 2002. This application is a continuation-in-part of U.S. 

Application No. 10/139,591, filed on May 6, 2002, which claims priority to U.S. 

Provisional Patent Application Serial No. 60/288,660, entitled "Helix-Turn-Helix 
10 Protein Modulating Compounds and Methods of Use Thereof," filed on May 4, 2001 . 

The entire contents of each of the aforementioned applications are hereby incorporated 

herein by reference. 

Background of the Invention 

15 Most antibiotics currently used and in development to treat bacterial 

infections impose selective pressure on microorganisms and have led to the development 
of widespread antibiotic resistance. Therefore, the development of an alternative 
approach to treating microbial infections would be of great benefit. 

Multidrug resistance in bacteria is generally attributed to the acquisition 

20 of multiple transposons and plasmids bearing genetic determinants for different 

mechanisms of resistance (Gold et al 1996. N. Engl. J. Med. 335:1445). However, 
descriptions of intrinsic mechanisms that confer multidrug resistance have begun to 
emerge. The first of these was a chromosomally encoded multiple antibiotic resistance 
(mar) locus in Escherichia coli (George and Levy, 1983. J. Bacterid. 155:531; George 

25 and Levy 1983 J. Bacteriol. 155:541). Mar mutants of E. coli arose at a frequency of 
10" 6 to 10" 7 and. were selected by growth on subinhibitory levels of tetracycline or 
chloramphenicol (George and Levy, supra). These mutants exhibited resistance to 
tetracyclines, chloramphenicol, penicillins, cephalosporins, puromycin, nalidixic acid, 
and rifampin (George and Levy, supra). Later, the resistance phenotype was extended to 

30 include fluoroquinolones (Cohen et al. 1989. Antimicrob. Agents Chemother. 33:1318), 
oxidative stress agents ( Ariza et al. 1994. J. Bacteriol. 176:143; Greenberg et al 1991. 
J. Bacteriol. 73:4433), and more recently, organic solvents (White et al. 1997. J. of 
Bacteriology 179:6122; Asako, et al 1997. J. Bacteriol. 176:143) and household 
disinfectants, e.g., pine oil and/or TRICLOSAN® ( McMurry et al 1998. FEMS 

35 Microbiology Letters 166:305; Moken et al. 1997. Antimicrobial Agents and 
Chemotherapy 41:2770). 
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The mar locus consists of two divergently positioned transcriptional units 
that flank a common promoter/operator region in E. coli, Salmonella typhimurium, and 
other Entrobacteriacae (Alekshun and Levy. 1997, Antimicrobial Agents and 
Chemother. 41 : 2067). One operon encodes MarC, a putative integral inner membrane 
5 protein without any yet apparent function, but which appears to contribute to the Mar 
phenotype in some strains. The other operon comprises marRAB, encoding the Mar 
repressor (MarR), which binds marO and negatively regulates expression of marRAB 
(Cohen et al. 1994. J. Bacteriol 175:1484; Martin and Rosner 1995. PNAS 92:5456; 
Seoane and Levy. 1995 J. Bacteriol 177:530), an activator (MarA), which controls 

10 expression of other genes on the chromosome, e.g. 9 the mar regulon (Cohen et al. 1994 
J. Bacteriol 175:1484; Gambino et. al. 1993. J. Bacteriol 175:2888; Seoane and Levy, 
1995 J. Bacteriol 177:530), and a putative small protein (MarB) of unknown function. 

Exposure of E. coli to several chemicals, including tetracycline and 
chloramphenicol (Hachler et al 1991 J Bacteriol 173(1 7):5532-8; Ariza, 1994, J 

15 Bacteriol; 176(l):143-8), sodium salicylate and its derivatives (Cohen, 1993, JBacteriol; 
175(24):7856-62) and oxidative stress agents (Seoane et al 1995. J Bacteriol; 
177(12):3414-9) induces the Mar phenotype. Some of these chemicals act directly at the 
level of MarR by interacting with the repressor and inactivating its function (Alekshun. 
1999. J. Bacteriol 181:3303-3306) while others (antibiotics such as tetracycline and 

20 chloramphenicol) appear to induce mar expression by an alternative mechanism 
(Alekshun. 1999. J. Bacteriol 181:3303-3306) e.g., through a signal transduction 
pathway. 

Once expressed, MarA activates the transcription of several genes that constitute 
the E. coli mar regulon (Alekshun, 1997, Antimicrob. Agents Chemother. 41:2067-2075; 

25 Alekshun, 1999, J. Bacteriol 181:3303-3306). With respect to decreased antibiotic 
susceptibility, the increased expression of the AcrAB/TolC multidrug efflux system 
(Fralick, 1996, J Bacteriol 178(19):5803-5; Okusu, 1996 JBacteriol;178(l):306-8) and 
decreased synthesis of OmpF (Cohen, 1988, JBacteriol; 1 70(1 2):541 6-22) an outer 
membrane protein, play major roles. Organic solvent tolerance, however, is attributed to 

30 MarA mediating increased expression of AcrAB, TolC, OmpX, and a 77 kDa protein 
(Aono, 1998, Extremophiles; 2(3):239-48; Aono, 1998 JBacteriol; 180(4):938-44.) but 
is independent of OmpF levels (Asako, 1999, Appl Environ Microbiol; 65(l):294-6). 

MarA is a member of the XylS/AraC family of transcriptional activators 
(Gallegos et al. 1993. Nucleic Acids Res. 21 :807). There are more than 100 proteins 

35 within the XylS/AraC family and a defining characteristic of this group of proteins is the 
presence of two helix-turn-helix (HTH) DNA binding motifs. Proteins within this 
family activate many different genes, some of which produce antibiotic and oxidative 
stress resistance or control microbial metabolism and virulence (Gallegos et al. supra). 
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Summary of the Invention 

The instant invention identifies microbial transcription factors, e.g., 
transcription factors of the AraC-XylS family, as virulence factors in microbes and 
shows that inhibition of these factors reduces the virulence of microbial cells. Because 
5 these transcription factors control virulence, rather than essential cellular processes, the 
development of resistance is much less likely. Accordingly, in one aspect, the invention 
is directed to a method for preventing infection of a subject by a microbe comprising: 
administering a compound that modulates the expression or activity of a microbial t 
transcription factor to a subject at risk of developing an infection such that infection of , 
10 the subj ect is prevented. j 

In one embodiment, the transcription factor is a member of the AraC- 
XylS family of transcription factors. 

In one embodiment, the transcription factor is a member of the MarA 
family of transcription factors. j 
15 In another embodiment, the method further comprises administering an 

antibiotic. 

In another aspect, the invention pertains to a method for preventing 
urinary tract infection of a subject by a microbe comprising: administering a compound | 
that modulates the expression or activity of a microbial transcription factor to a subject j 

i 

20 at risk of developing a urinary tract infection such that infection of the subject is 

prevented. ' 

In yet another aspect, the invention pertains to a method for reducing I 
virulence of a microbe comprising: administering a compound that modulates the j 
expression or activity of a microbial transcription factor to a subject at risk of developing I 
25 an infection with the microbe such that virulence of the microbe is reduced. i 
In one embodiment, the transcription factor is a member of the AraC- 
XylS family of transcription factors. I 
In another embodiment, the transcription factor is a member of the MarA j 
family of transcription factors. j 
30 In yet another embodiment, the method further comprises administering 

an antibiotic. . 

In another aspect, the invention pertains to a method for treating a , 
microbial infection in a subject comprising: administering a compound that modulates 
the expression or activity of a transcription factor to a subject having a microbial 
35 infection such that infection of the subject is treated. 

In one embodiment, the transcription factor is a member of the AraC- 
XylS family of transcription factors. 
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In another embodiment, the transcription factor is a member of the MarA 
family of transcription factors. 

In still another embodiment, the invention further comprises 
administering an antibiotic. 
5 In another aspect, the invention pertains to a method for treating a urinary 

tract infection in a subject comprising: administering a compound that modulates the 
expression or activity of a transcription factor to a subject having a urinary tract infection 
such that infection of the subject is treated. 

In one embodiment, the transcription factor is a member of the AraC- 
1 0 XylS family of transcription factors. 

In one embodiment, the transcription factor is a member of the MarA 
family of transcription factors. 

In another embodiment, the method further comprises administering an 

antibiotic. 

1 5 In another aspect, the invention pertains to a method for reducing 

virulence in a microbe comprising: administering a compound that inhibits the 
expression or activity of a transcription factor to a subject having a microbial infection 
such that virulence of the microbe is reduced. 

In one embodiment, the transcription factor is a member of the AraC- 
20 XylS family of transcription factors. 

In another embodiment, the transcription factor is a member of the MarA 
family of transcription factors. 

In yet another embodiment, the method further comprises administering 

an antibiotic. 

25 In another aspect, the invention pertains to a method for evaluating the 

effectiveness of a compound that modulates the expression or activity of a microbial 
transcription factor at inhibiting microbial virulence comprising: infecting a non-human 
animal with a microbe, wherein the ability of the microbe to establish an infection in the 
non-human animal requires that the microbe colonize the animal; administering the 

30 compound that modulates the expression or activity of the microbial transcription factor 
to the non-human animal; and determining the level of infection of the non-human 
animal, wherein the ability of the compound to reduce the level of infection of the 
animal indicates that the compound is effective at inhibiting microbial virulence. 

In one embodiment, the transcription factor is a member of the AraC- 

35 XylS family of transcription factors. 

In another embodiment, the transcription factor is a member of the MarA 
family of transcription factors. 
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In yet another embodiment, the method further comprises administering 

an antibiotic. 

In still another embodiment, the level of infection of the non-human 
animal is determined by measuring the ability of the microbe to colonize the tissue of the 

5 non-human animal. 

In another embodiment, the level of infection of the non-human animal is 
determined by enumerating the number of microbes present in the tissue of the non- 
human animal. 

In another aspect, the invention pertains to a method for identifying a 
10 compound for treating microbial infection, comprising: innoculating a non-human 

animal with a microbe, wherein the ability of the microbe to establish an infection in the 
non-human animal requires that the microbe colonize the animal; administering a 
compound which reduces the expression or activity of a microbial transcription factor to 
the animal, and determining the effect of the test compound on the ability of the microbe 
15 to colonize the animal, such that a compound for treating microbial infection is 
identified. 

In one embodiment, the transcription factor is a member of the AraO 
XylS family of transcription factors. 

In another embodiment, the transcription factor is a member of the Mar A 
20 family of transcription factors. 

In still another embodiment, the level of infection of the non-human 
animal is determined by measuring the ability of the microbe to colonize the tissue of the 
non-human animal. 

In another embodiment, the level of infection of the non-human animal is 
25 determined by enumerating the number of microbes present in the tissue of the non- 
human animal. 

In another aspect, method for identifying a compound for reducing 
microbial virulence, comprising: inoculating a non-human animal with a microbe, 
wherein the ability of the microbe to establish an infection in the non-human animal 
30 requires that the microbe colonize the animal; administering a compound which reduces 
the expression or activity of a microbial transcription factor to the animal, and 
determining the effect of the test compound on the ability of the microbe to colonize the 
animal, such that a compound for reducing microbial virulence is identified. 

In another embodiment, the transcription factor is a member of the AraC- 
35 XylS family of transcription factors. 

In still another embodiment, the transcription factor is a member of the 
MarA family of transcription factors. 
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In yet another embodiment, the level of infection of the non-human 
animal is determined by measuring the ability of the microbe to colonize the tissue of the 
non-human animal. 

In another embodiment, the level of infection of the non-human animal is 
5 determined by enumerating the number of microbes present in the tissue of the non- 
human animal. 

In another aspect, the invention pertains to a method for identifying transcription factors 
which promote microbial virulence comprising: creating a microbe in which a 
transcription factor to be tested is misexpressed; introducing the microbe into a non- 

10 human animal; wherein the ability of the microbe to establish an infection in the non- 
human animal requires that the microbe colonize the animal; and determining the ability 
of the microbe to colonize the animal, wherein a reduced ability of the microbe to 
colonize the animal as compared to a wild-type microbial cell identifies the transcription 
factor as a transcription factor which promotes microbial virulence. 

15 In another embodiment, the transcription factor is a member of the AraC- 

XylS family of transcription factors. 

In another embodiment, the transcription factor is a member of the MarA 
family of transcription factors. 

In another embodiment, the level of infection of the non-human animal is 

20 determined by measuring the ability of the microbe to colonize the tissue of the non- 
human animal. 

In another embodiment, the level of infection of the non-human animal is 
determined by enumerating the number of microbes present in the tissue of the non- 
human animal. 

25 In another aspect, the invention pertains to a method for reducing the 

ability of a microbe to adhere to an abiotic surface comprising: contacting the abiotic 
surface or the microbe with a compound that modulates the activity of a transcription 
factor such that the ability of the microbe to adhere to the abiotic surface is reduced. 

In one embodiment, the transcription factor is a member of the AraC- 
30 XylS family of transcription factors. 

In another embodiment, the transcription factor is a member of the MarA 
family of transcription factors. 

In yet another embodiment, the method further comprises contacting the 
abiotic surface or the microbe with a second agent that is effective at controlling the 
3 5 growth of the microbe. 

In still another embodiment, the abiotic surface is selected from the group 
consisting of: stents, catheters, and prosthetic devices. 
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In one aspect, the invention pertains to a pharmaceutical composition 
comprising a compound that modulates the activity or expression of a microbial 
transcription factor and a pharmaceutically acceptable carrier, wherein the compound 
reduces microbial virulence. 
5 In another aspect, the invention pertains to a pharmaceutical composition 

comprising a compound that modulates the activity or expression of a microbial 
transcription factor and an antibiotic in a pharmaceutically acceptable carrier. 

The present invention represents atfadvance over the prior art by 
identifying transcription factor modulating compounds, such as, but not limited to helix- 

1 0 turn-helix protein modulating compounds, and providing novel assays that can be used 
to identify compounds which modulate microbial transcription factors, such as MarA 
family polypeptides and AraC family polypeptides. Modulation of gene transcription 
brought about by the modulation of transcription factors, such as helix-tum-helix domain 
containing proteins, can control a wide variety of cellular processes. For example, in 

15 prokaryotic cells processes such as metabolism, resistance, and virulence can be 
controlled. 

Assays to identify compounds that are capable of modulating bacterial 
transcription factors would be of great benefit in the identification of agonists and 
antagonists that can be used to control gene transcription in both prokaryotic and 

20 eukaryotic cells. 

In one embodiment, the invention pertains to a method for reducing 
antibiotic resistance of a cell, e.g., a eukaryotic or prokaryotic cell. In a preferred 
embodiment, the cell is a microbial cell. In one embodiment, the invention pertains to a 
method for reducing antibiotic resistance in a microbial cell, by contacting a cell with a 

25 transcription factor modulating compound, such that the antibiotic resistance of the cell 
is reduced. In an embodiment, the transcription factor modulating compound is of the 
formula (I): 

A-E (I) 
wherein A is a polar moiety, E is a hydrophobic moiety, and pharmaceutically acceptable 
30 salts thereof. 

In another embodiment, the invention pertains to a method for modulating 
transcription. The method includes contacting a transcription factor with a transcription 
factor modulating compound, such that the transcription factor is modulated. The 
transcription factor modulating compound is of the formula (I): 
35 A-E (T) 

wherein A is a polar moiety; and E is a hydrophobic moiety, and pharmaceutically 
acceptable salts thereof. 
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In another embodiment, the invention also includes methods for 
identifying transcription factor modulating compounds. The method includes contacting 
a microbial cell with a test compound under conditions which allow interaction of the 
compound with the microbial cell and measuring the ability of the test compound to 
5 affect the cell. The microbial cell includes a selective marker under the direct control of 
a transcription factor responsive element and a transcription factor. 

In yet another embodiment, the invention includes methods for 
identifying a transcription factor modulating compound. The method includes 
contacting a microbial cell comprising: 1) a selective marker under the control of a 

10 transcription factor responsive element and 2) a transcription factor, with a test 

compound under conditions which allow interaction of the compound with the microbial 
cell, and measuring the ability of the test compound to affect the growth (eg., in vitro or 
in vivo) or survival of the microbial cell, wherein the inactivation of the transcription 
factor leads to a decrease in in vitro or in vivo cell survival. The invention also pertains 

15 to similar methods where the inactivation of the transcription factor leads to an increase 
in cell survival, as well as methods wherein the activation of the transcription factor 
leads to increased or, alternatively, decreased cell survival. 

In another embodiment, the invention also pertains to methods for 
identifying a transcription factor modulating compound, by contacting a microbial cell 

20 comprising: 1) a chromosomal deletion in a guaB or purA gene, 2) heterologous guaB or 
purA gene under the control of its transcription factor responsive promoter, and 3) a 
transcription factor, with a test compound under conditions which allow interaction of 
the compound with the microbial cell. The method further includes the steps of 
measuring the ability of the compound to affect gene expression of the reporter or the 

25 growth or survival of the microbial cell as an indication of whether the compound 

modulates the activity of a transcription factor. The ability of the compound to modulate 
the activity of a transcription factor leads to an alteration in gene expression may effect 
cell growth or survival. 

30 The invention pertains to transcription fector modulating compounds, 

HTH protein modulating compounds, and MarA family modulating compounds 
identified by the methods of the invention, methods of using these compounds and 
pharmaceutical compositions comprising these compounds. The transcription factor 
modulating compounds of the invention include, but are not limited to, compounds of 

35 formulae (I)-(X) and Tables 4, 5, 6, 7 and 8. 

The invention also pertains to methods using computer modeling 
programs to identify transcription factor modulating compounds. For example, the 
invention pertains to a method of identifying transcription factor modulating compounds. 
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The method includes obtaining the structure of the transcription factor modulating 
compound, and using or identifying a scaffold which has an interaction energy score of - 
20 or less with a portion of the transcription factor, thus identifying potential 
transcription factor modulating scaffolds. 
5 The invention also pertains, at least in part, to a kit for identifying a 

transcription factor modulating compound which modulates the activity of a 
, t^T^ctiq^on factor polypeptide comprising a microbial cell. The kit includes 1) a 
. $ selective marker under the control of a transcription factor responsive element and 2) a 
transcription factor. 

10 The invention also pertains, at least in part, to pharmaceutical 

compositions which contain an effective amount of a transcription factor modulating 
compound, and, .optionally, a pharmaceutical^ acceptable carrier. 

The invention also pertains to a method of inhibiting a biofilm, by 
administering a composition comprising a transcription factor modulating compound 

1 5 such that the biofilm is inhibited 

In a further embodiment, the invention pertains to a pharmaceutical 
composition comprising an effective amount of a transcription factor modulating 
compound, and a pharmaceutical^ acceptable carrier. The transcription factor 
modulating compound is of the formula (II): 




20 z 3 ^ a 1 ^ q (H) 

wherein 

WisOorS; 

X is O, S, or C, optionally linked to Q; 
A 1 is C-Z 4 , 0, or S; 
25 A 2 isC-Z 5 ,orN-Z 5 ; 

Z 1 , Z 2 , Z 3 , Z 4 and Z 5 are each independently hydrogen, alkoxy, hydroxy, 
halogen, alkyl, alkenyl, alkynyl, aryl, heterocyclic, amino, or cyano; 

Z 3 is hydrogen, alkoxy, hydroxy, halogen, alkyl, alkenyl, alkynyl, aryl, 
heterocyclic, amino, nitro, cyano, carbonyl, or thiocarbonyl; 
30 Q is an aromatic or heterocyclic moiety, and pharmaceutically acceptable 

salts thereof. 

In another further embodiment, the invention pertains to a pharmaceutical 
composition comprising an effective amount of a transcription factor modulating 
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compound, and a phannaceuticaUy acceptable carrier. The compound is of the formula 
OH): 




wherein 

5 G is substituted or unsubstituted aromatic moiety, heterocyclic, alkyl, 

alkenyl, alkynyl, hydroxy, cyano, nitro, amino, carbonyl, or hydrogen; and 

L l , L 2 , L 3 , L 4 , L 5 , L 6 , L 7 , L 8 , L 9 , and L 10 are each independently oxygen, 
substituted or unsubstituted nitrogen, sulfur and or substituted or unsubstituted carbon, 
and phannaceuticaUy acceptable salts thereof. 
10 In yet another embodiment, the invention pertains to a pharmaceutical 

composition comprising an effective amount of a transcription factor modulating 
compound and a pharmaceutically acceptable carrier. The transcription factor 
modulating compound is of the formula (IV): 




15 wherein 

Y 1 and Y 2 are each oxygen or sulfur; 

J is hydrogen, substituted or unsubstituted alkyl, alkenyl, alkynyl, cyano, 
nitro, amino, or halogen; 

V is substituted or unsubstituted alkyl, alkenyl, alkynyl, alkoxy, 
20 alkylamino, or alkylthio; 

P and K are each independently substituted or unsubstituted aryl, and 
pharmaceutically acceptable salts thereof. 

In another embodiment, the invention pertains to a pharmaceutical 
composition comprising a pharmaceutically acceptable carrier and a transcription factor 
25 modulating compound. The transcription factor modulating compound is of the formula 
(V): 
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V ' (V) 
wherein 

T\ T 2 , T 3 , T 4 , T 5 , and T 6 are each independently substituted or 
unsubstituted carbon, oxygen, substituted or unsubstituted nitrogen, or sulfur; 
5 M is hydrogen, alkyl, alkenyl, heterocyclic, alkynyl, or aryl, or 

phannaceutically acceptable salts thereof. 

In yet another embodiment, the invention pertains to a pharmaceutical 
composition comprising a phannaceutically acceptable carrier and a transcription factor 
modulating compound The transcription factor modulating compound is of the formula 
10 (Va): 




(Va) 
wherein 

R 1 is OH, OCOCO2H, or a substituted or unsubstituted straight or 
15 branched C1-C5 alkyloxy group; 

R 2 is H, C0 2 (C r C5 substituted or unsubstituted, straight or branched 
alkyl), or a substituted or unsubstituted aryl group; and 

R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of 
H, (C1-C5 substituted or unsubstituted, straight or branched alkyl), 
20 C0 2 (CrC5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 
CO(C3-C6 substituted or unsubstituted cycloalkyl), 
0(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
25 C(NOH)(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 

substituted or unsubstituted amino, CO2H, CN, N0 2 , CONH 2 , (CO)(NHOH), and 
halogen. 

In certain embodiments of formula Va, those compounds disclosed in 
U.S. 10/139,591, filed May 6, 2002, are excluded from the scope of the present 
30 invention. 
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In other embodiments of formula Va, when R 6 is NO2 and R 2 is 
unsubstituted phenyl, then R 1 is not 0(CHCH 3 )(CO 2 )CH 2 CH3 or OCH2CO2H. Also, in 
another embodiment, when R 6 is H or NO2, then R 1 is not a phenyl-substituted alkyloxy 
group. In yet another embodiment, when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is para- 
5 methoxyphenyl, then R 1 is not OH. And in another embodiment, when R 4 , R 5 , R 6 , and 
R 7 are all H and R 2 is unsubstituted phenyl, then R l is not OCH2CO2CH2CH3; 

In certain aspects of formula Va, R 4 , R 5 , and R 7 are all H. 

Similarly, R 1 of formula Va may be selected from the group consisting of 
OH, 0(CR'R")i- 3 H, 0(CR'R") w OH, 0(CRTT)i-3C0 2 H, O(CR'R")i-3CO 2 (CR'R'0i- 
10 3 H, 0(CR'R^i-3(CO)NH 2 , OCCR'R^wCCNH)^, OCOCO2H, 0(CR'R")i-3S0 3 H, 
0(CR'R")i- 3 OS0 3 H, OCCR'R'OwPOsH, 0(CR'R")i- 3 OP0 3 H, 
0(CR'R")i-3N[(CR'R J V3H] 2 , 0(CR'R")i- 3 (CO)(NHOH), and 0(CR'R")i- 
3 (heteroaryl); 

wherein R' and R" are each independently H, a C1-C3 alkyl, C2-C3 alkenyl, or C2-C3 
1 5 alkynyl group. Each R' and R" is preferably H or CH 3 . 

When R l of formula Va is 0(CR'R")i- 3 (heteroaryl), the heteroaryl group 
maybe a pyrrolyl, furanyl, thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, 
tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or 
pyrimidinyl group. 

20 Similarly, when R 2 of formula Va may be a substituted or unsubstituted 

phenyl, pyrrolyl, furanyl, thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, 
tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or 
pyrimidinyl group. 

In a more particular embodiment, R 6 of formula Va is H, (CR'R")i-3H, 

25 (CR'R")i-30H, (CR'R")i- 3 NH 2 , (NOH)(CR'R")i-3H, CO(CR'R")o-3NH 2 , 
CO(CR'R")i-3H, CO(CR'R") w OH, CO(CR'R' V3CF 3 , 
(CR'R 1 V3N[(CR , R ,, )o. 3 H] 2 , CO(substituted or unsubstituted heteroaryl), 
CO(C 3 -C 6 substituted or unsubstituted cycloalkyl), 0(CR'R")i- 3 H, 
CO(substituted or unsubstituted phenyl), C0 2 (CR'R' V3H, CN, N0 2 , F, CI, Br, or I, 

30 wherein R' and R" are each independently H, a Cr-C 3 alkyl, C2-C3 alkenyl, or C2-C3 
alkynyl group. Preferably each R* and R" is independently H or CH 3 . 

In yet another embodiment, R 6 of formula Va is CO(substituted or 
unsubstituted heteroaryl), wherein said heteroaryl group is a pyrrolyl, furanyl, 
thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, tetrazolyl, pyrazolyl, oxazolyl, 

35 isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or pyrimidinyl group. 
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In another embodiment, the invention pertains to a pharmaceutical 
composition comprising a pharmaceutical acceptable carrier and a transcription factor 
modulating compound. The transcription factor modulating compound may be of the 
formula (VI): 



5 




wherein 

G l , G 2 , and G 3 are each independently O, S, substituted or unsubstituted 
nitrogen, or substituted or unsubstituted carbon; 

E\ E 2 , and E 3 are each independently hydrogen, alkyl, alkenyl, alkynyl, 
10 aryl, aralkyl, or acyl; and 

E 4 is alkyl, alkenyl, alkynyl, aryl, halogen, cyano, amino, nitro, or acyl, 
and pharmaceutically acceptable salts thereof. 

Brief Description of the Drawings 

1 5 Figure 1 is a multiple sequence alignment of AraC-XylS family 

polypeptides. 

Figure 2 is a multiple sequence alignment of PROSITE PS00041 and 
AraC family polypeptides. 

Figure 3 is a multiple sequence alignment of PROSITE PS01 124 and 
20 AraC family polypeptides. 

Figure 4 is a CoMFA contour map for a representative triazinoxazepine. 

Detailed Description of the Invention 

The instant invention identifies microbial transcription factors, e.g., 
25 transcription factors of the AraC-XylS family, as virulence factors in microbes and 

shows that inhibition of these factors reduces the virulence of microbial cells. Because 
these transcription factors control virulence, rather than essential cellular processes, 
modulation of these factors should not promote resistance. 

Some major families of transcription factors found in bacteria include the 
30 helix-turn-helix transcription factors (HTH) ( Hanison, S. C, and A. K. Aggarwal 1990. 
Annual Review of Biochemistry. 59:933-969) such as AraC, MarA, Rob, SoxS and LysR; 
winged helix transcription factors ( Gajiwala, K. S., and S. K. Burley 2000. 10: 110-1 16), 
e.g., MarR, Sar/Rot family, and OmpR ( Huffinan, J. L., and R. G. Brennan 2002. Curr 
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Opin Struct Biol 12:98-106, Martfaez-Hackert, E., and A. M. Stock 1997. Structure. 
5:109-124); and looped-hinge helix transcription factors (Huffinan, J. L., and R. G. 
Brennan 2002 Carr Opin Struct Biol 12:98-106), e.g. the AbrB protein family. 

The AraC-XylS family of transcription factors comprises many members. 
5 MarA, SoxS, Rma, and Rob are examples of proteins within the AraC-XylS family of 
transcription factors. These factors belong to a subset of the AraC-XylS family that have 
historically been considered to play roles in promoting resistance to multiple antibiotics 
and have not been considered to be virulence factors. In fact, the role of marA in 
virulence has been tested using a marA null mutant of Salmonella enterica serovar 

10 Typhimurium (S. typhimurium) in a mouse infection model ( Sulavik et al. 1997. J. 
Bacteriology 1 79: 1 857) and no such role has been found. In another model (using co- 
infection experiments or crude statistics) only a weak effect of a marA null mutant in 
chickens has been demonstrated (Randall et al. 2001. J. Med. Microbiol. 50:770). In 
contrast to this earlier work, this invention is based, at least in part, on the finding that 

15 the ability of microbes to cause infection in a host can be inhibited by inhibiting the 
expression and/or activity of microbial transcription factors. Thus, the instant invention 
validates the use of microbial transcription factors as therapeutic targets. 

The invention pertains, at least in part, to compounds which modulate 
transcription factors (e.g., helix-turn-helix (HTH) proteins, AraC family polypeptides, 

20 MarA family polypeptides, etc.), methods of identifying the transcription factor 
modulating compounds (e.g., HTH protein modulating compounds, AraC family 
polypeptide modulating compounds, MarA family polypeptide modulating compounds, 
etc.), and methods of using the compounds. 

25 L Transcription factors 

The term 'transcription factor" includes proteins that are involved in gene 
regulation in both prokaryotic and eukaryotic organisms. In one embodiment, 
transcription factors can have a positive effect on gene expression and, thus, may be 
referred to as an "activator" or a 'transcriptional activation factor." In another 

30 embodiment, a transcription factor can negatively effect gene expression and, thus, may 
be referred to as "repressors" or a 'transcription repression factor." Activators and 
repressors are generally used terms and their functions are discerned by those skilled in 
the art. 

As used herein, the term "infectivity" or "virulence" includes the ability 
35 of a pathogenic microbe to colonize a host, a first step required in order to establish 
growth in a host. Infectivity or virulence is required for a microbe to be a pathogen. In 
addition, a virulent microbe is one which can cause a severe infection. As used herein, 
the term "pathogen" includes both obligate and opportunistic organisms. The ability of 
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a microbe to resist antibiotics is also important in promoting growth in a host, however, 
in one embodiment, antibiotic resistance is not included in the terms "infectivity" or 
"virulence" as used herein. Accordingly, in one embodiment, the instant invention 
pertains to methods of reducing the infectivity or virulence of a microbe without 
5 affecting (e.g., increasing or decreasing) antibiotic resistance. Preferably, as used herein, 
the term "infectivity or virulence" includes the ability of an organism to establish itself 
in a host by evading the host's barriers and immunologic defenses. 

The term "AraC family polypeptide," "AraC-XylS family polypeptide" or 
"AraC-XylS family peptide" include an art recognized group of prokaryotic transcription 

10 factors which contains more than 100 different proteins (Gallegos et al, (1997) Micro. 
Mol. Biol. Rev. 61: 393; Martin and Rosner, (2001) Curr. Opin. Microbiol. 4:132). 
AraC family polypeptides include proteins defined in the PROSITE (PS) database 
Oittp://www. exDasv.ch/prositeA as profile PS01 124. The AraC family polypeptides also 
include polypeptides described in PS0041, HTH AraC Family 1, and PS01 124, and HTH 

1 5 AraC Family 2. Multiple sequence alignments for the AraC-XylS family polypeptides, 
HTH AraC family 1, and HTH AraC family 2 are shown in Figures 1-3, respectively. In 
an embodiment, the AraC family polypeptides are generally comprised of, at the level of 
primary sequence, by a conserved stretch of about 100 amino acids, which are believed 
to be responsible for the DNA binding activity of this protein (Gallegos et al, (1997) 

20 Micro. Mol. Biol Rev. 61 : 393; Martin and Rosner, (2001) Curr. Opin. Microbiol 4: 
132). AraC family polypeptides also may include two helix turn helix DNA binding 
motifs (Martin and Rosner, (2001) Curr. Opin. Microbiol. 4: 132; Gallegos et al, (1997) 
Micro. Mol. Biol. Rev. 61: 393; Kwon et al, (2000) Nat. Struct. Biol 7: 424; Rhee et al, 
(1998) Proc. Natl Acad. Sci. U.S.A. 95: 10413). The term includes MarA family 

25 polypeptides and HTH proteins. In one embodiment, the invention pertains to a method 
for modulating an AraC family polypeptide, by contacting the AraC family polypeptide 
with a test compound which interacts with a portion of the polypeptide involved in DNA 
binding. In a further embodiment, the test compound interacts with a conserved 
aminoacid residue (capitalized) of the HTH AraC family 1 protein indicated in Figure 2. 

30 The term "helix-tum-heUx protein," "HTH protein," "helix-tum-helix 

polypeptides," and "HTH polypeptides," includes proteins comprising one or more 
helix-tum-helix domains. Helix-tum-helix domains are known in the art and have been 
imphcated in DNA binding (^/wi?ev. ofBiochem. 1984. 53:293). An example of the 
consensus sequence for a helix-turn domain can be found in Brunelle and Schleif (1989. 

35 J. Mol. Biol. 209:607). The domain has been illustrated by the sequence 

XXXPhoAlaXXPhoGlyPhoXXXXPhoXXPhoXX, where X is any amino acid and Pho is 
a hydrophobic amino acid. 
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The helix-tuni-helix domain was the first DNA-binding protein motif to 
be recognized. Although originally the HTH domain was identified in bacterial proteins, 
the HTH domain has since been found in hundreds of DNA-binding proteins from both 
eukaryotes and prokaryotes. It is constructed from two alpha helices connected by a 
5 short extended chain of amino acids, which constitutes the "turn." 

In one embodiment, a helix-turn-helix domain containing protein is a Mar 
A family polypeptide. The language "MarA family polypeptide" includes the many 
naturally occurring HTH proteins, such as transcription regulation proteins which have 
sequence similarities to MarA and which contain the MarA family signature pattern, 
10 which can also be referred to as an XylS/AraC signature pattern. An exemplary 

signature pattern which defines MarA family polypeptides is shown, e.g., on PROSITE 
and is represented by the sequence: ^<^-[LIVMA]-X(2>^^ 
{FYWPGDN}X(2)-[LIVMSA]-X(4,9)-[IJVMF] 

X(3MGANQRFHLIVM^ 

15 [GSADENQKR]-X.[NSTAPKL]-[PARL], where X is any amino acid. MarA family 
polypeptides have two "helix-turn-helix" domains. This signature pattern was derived 
from the region that follows the first, most amino terminal, helix-turn-helix domain 
(HTH1) and includes the totality of the second, most carboxy terminal helix-tum-helix 
domain (HTH2). (See PROSITE PS00041). 

20 The MarA family of proteins ("MarA family polypeptides") represent one subset 

of AraC-XylS family polypeptides and include proteins like MarA, SoxS, Rob, Rma, 
AarP, PqrA, etc. The MarA family polypeptides, generally, are involved in regulating 
resistance to antibiotics, organic solvents, and oxidative stress agents (Alekshun and 
Levy, (1997) Antimicrob. Agents. Chemother. 41: 2067). Like other AraC-XylS family 

25 polypeptides, MarA-like proteins also generally contain two HTH motifs as exemplified 
by the MarA and Rob crystal structures (Kwon et aL, (2000) Nat. Struct Biol 7: 424; 
Rhee et al, (1998) Proc. Natl. Acad. Sci. U.S.A. 95: 10413). Members of the MarA 
family can be identified by those skilled in the art and will generally be represented by 
proteins with homology to amino acids 30-76 and 77-106 of MarA (SEQ ID. NO. 1). 

30 Preferably, a MarA family polypeptide or portion thereof comprises the 

first MarA family HTH domain (HTH1) (Brunelle, 1989, JMol Biol; 209(4):607-22). In 
another embodiment, a MarA polypeptide comprises the second MarA family HTH 
domain (HTH2) (Caswell, 1992, Biochem J.; 287:493-509.). In apreferred embodiment, 
a MarA polypeptide comprises both the first and second MarA family HTH domains. 

35 MarA family polypeptide sequences are "structurally related" to one or 

more known MarA family members, preferably to MarA. This relatedness can be shown 
by sequence or structural similarity between two MarA family polypeptide sequences or 
between two MarA family nucleotide sequences that specify such polypeptides. 
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Sequence similarity can be shown, e.g., by optimally aligning MarA family member 
sequences using an alignment program for purposes of comparison and comparing 
corresponding positions. To determine the degree of similarity between sequences, they 
will be aligned for optimal comparison purposes (e.g., gaps may be introduced in the 

5 sequence of one protein for nucleic acid molecule for optimal alignment with the other 
protein or nucleic acid molecules). The amino acid residues or bases and corresponding 
amino acid positions or bases are then compared. When a position in one sequence is 
occupied by the same amino acid residue or by the same base as the corresponding 
position in the other sequence, then the molecules are identical at that position. If amino 

1 0 acid residues are not identical, they may be similar. As used herein, an amino acid 
residue is "similar" to another amino acid residue if the two amino acid residues are 
members of the same family of residues having similar side chains. Families of amino 
acid residues having similar side chains have been defined in the art (see, for example, 
Altschul et al. 1990. J. Mol Biol. 215:403) including basic side chains (e.g., lysine, 

15 arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged 
polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, 
cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, 
valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan). 

20 The degree (percentage) of similarity between sequences, therefore, is a function of the 
number of identical or similar positions shared by two sequences (i.e., % homology = # 
of identical or similar positions/total # of positions x 100). Alignment strategies are well 
known in the art; see, for example, Altschul et al. supra for optimal sequence alignment. 
MarA family polypeptides may share some amino acid sequence 

25 similarity with MarA. The nucleic acid and amino acid sequences of MarA as well as 
other MarA family polypeptides are available in the art. For example, the nucleic acid 
and amino acid sequence of MarA can be found, e.g., on GeneBank (accession number 
M96235 or in Cohen et al 1993. J. Bacterid 175:1484, or in SEQ ID NO:l and SEQ 
IDNO:2. 

30 The nucleic acid and/or amino acid sequences of MarA can be used as 

"query sequences" to perform a search against databases (e.g., either public or private) 
to, for example, identify other MarA family members having related sequences. Such 
searches can be performed, e.g., using the NBLAST and XBLAST programs (version 
2.0) of Altschul, et al (1990) J. Mol Biol 215:403-10. BLAST nucleotide searches can 

35 be performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to MarA family nucleic acid molecules. BLAST 
protein searches can be performed with the XBLAST program, score = 50, wordlength = 
3 to obtain amino acid sequences homologous to MarA protein molecules of the 
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invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can 
be utilized as described in Altschul et al y (1997) Nucleic Acids Res. 25(17):3389-3402. 
When utilizing BLAST and Gapped BLAST programs, the default parameters of the 
respective programs (e.g., XBLAST and NBLAST) can be used. See 
5 http://www.ncbi.iilm.nih.gov. 

MarA family members can also be identified as being similar based on 
their ability to specifically hybridize to nucleic acid sequences specifying MarA. Such 
stringent conditions are known to those skilled in the art and can be found e.g, in 
Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. 

10 A preferred, non-limiting example of stringent hybridization conditions are hybridization 
in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one or more 
washes in 0.2 X SSC, 0.1% SDS at 50-65°C. Conditions for hybridizations are largely 
dependent on the melting temperature Tm that is observed for half of the molecules of a 
substantially pure population of a double-stranded nucleic acid. Tm is the temperature in 

15 °C at which half the molecules of a given sequence are melted or single-stranded. For 
nucleic acids of sequence 1 1 to 23 bases, the Tm can be estimated in degrees C as 
2(number of A+T residues) + 4(number of C+G residues). Hybridization or annealing of 
nucleic acid molecules should be conducted at a temperature lower than the Tm, e.g., 15 
°C, 20°C, 25°C or 30°C lower than the Tm. The effect of salt concentration (in M of 

20 NaCl) can also be calculated, see for example, Brown, A., "Hybridization" pp. 503-506, 
in The Encyclopedia ofMolec. Biol, J. Kendrew, Ed., Blackwell, Oxford (1994). 

Preferably, the nucleic acid sequence of a MarA family member 
identified in this way is at least about 10%, 20%, more preferably at least about 30%, 
more preferably at least about 40% identical and preferably at least about 50%, or 60% 

25 identical to a MarA nucleotide sequence. In preferred embodiments, the nucleic acid 
sequence of a MarA family member is at least about 70%, 80%, preferably at least about 
90%, more preferably at least about 95% identical with a MarA nucleotide sequence. 
Preferably, MarA family members have an amino acid sequence at least about 20%, 
preferably at least about 30%, more preferably at least about 40% identical and 

30 preferably at least about 50%, or 60% or more identical with a MarA amino acid 
sequence. In preferred embodiments, the nucleic acid sequence of a MarA family 
member is at least about 70%, 80%, more preferably at least about 90%, or more 
preferably at least about 95% identical with a MarA nucleotide sequence. However, it 
will be understood that the level of sequence similarity among microbial regulators of 

35 gene transcription, even though members of the same family, is not necessarily high. 
This is particularly true in the case of divergent genomes where the level of sequence 
identity may be low, eg, less than 20% (eg., B. burgdorferi as compared e.g., to B. 
subtilis). Accordingly, structural similarity among MarA family members can also be 
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determined based on 'three-dimensional correspondence" of amino acid residues. As 
used herein, the language "three-dimensional correspondence" is meant to includes 
residues which spatially correspond, e.g 9 are in the same position of a MarA family 
polypeptide member as determined, e.g., by x-ray crystallography, but which may not 
5 correspond when aligned using a linear alignment program. The language "three- 
dimensional correspondence" also includes residues which perform the same function, 
e.g., bind to DNA or bind the same cofactor, as determined, e.g., by mutational analysis. 

Exemplary MarA family polypeptides are shown in Table 1, Figures 1-3, 
and at Prosite (PS00041) and include: AarP, Ada, AdaA, AdiY, AfrR, AggR, AppY, 

10 AraC, CafR, CelD, CfaD, CsvR, D90812, EnvY, ExsA, FapR, HrpB, InF, InvF, LcrF, 
LumQ, MarA, MelR, MixE, MmsR, MsmR, OrfR, OrfJ375, PchR, PerA, PocR, PqrA, 
RafR, RamA, RhaR, RhaS, Rns, Rob, SoxS, S52856, TetD, TcpN, ThcR, TmbS, 
U73857, U34257, U21191, UreR, VirF, XylR, XylS, Xysl, 2, 3, 4, Ya52, YbbB, YfiF, 
YisR, YzbC, and YijO. The nucleotide and amino acid sequences of the E. coli Rob 

15 molecule are shown in SEQ ID NO:3 and 4, respectively. 



TABLE 1. Some Bacterial MarA homologs a 



Gram-negative bacteria 




Gram-positive bacteria 


Escherichia coli 


Klebsiella pneumoniae 


Lactobacillus helveticus 


MarA(l) 


RamA (27) 


U34257 (38) 


OrfR (2, 3) 






SoxS (4, 5) 


Haemophilus influenzae 


Azorhizobium caulinodans 


AfrR (6) 


Ya52 (28) 


S52856 (39) 


AraC (7) 






CelD (8) 


Yersinia spp. 


Streptomyces spp. 


D90812 (9) 


CafR(29) 


U21191 (40) 


FapR (10,11) 


LcrF (30) or VirF (30) 


AraL(41) 


MelR (12) 






ORF /375 (13, 14) 


Providencia stuartii 


Streptococcus mutans 


RhaR(15,16,17) 


AarP (31) 


MsmR (42) 


RhaS (18) 






Rob (19) 


Pseudomonas spp. 


Pediococcus pentosaceus 


U73857 (20) 


MmsR (32) 


RafR (43) 


XylR (21) 


TmbS (33) 




YijO (22) 


XylS (34) 


Photobacterium leiognathi 




Xysl,2,3,4(35,36) 


LumQ (44) 



-19- 



WO 2004/041209 



PCT/US2003/035205 



Proteus vulgaris 
PqrA(23) 



Cyanobacteria 
Synechocystis spp. 
LumQ (37) 
PchR(37) 



Bacillus subtilis 



Mar A (24) 
InvF (25) 
PocR(26) 



Salmonella typhimurium 



AdaA(45) 
YbbB (46) 
YfiF (47) 
YisR(48) 
YzbC (49) 



a The smaller MarA homologs, ranging in size from 87 (U34257) to 138 (OrfR) amino 
acid residues, are represented in boldface. References are given in parentheses and are 
listed below. 

5 References for Table 1 : 

(1) S.P. Cohen, et al. 1993. / Bacteriol. 175:1484-1492 

(2) G.M. Braus, et al. 1984. J. Bacteriol. 160:504-509 

(3) K. Schollmeier, et al., 1984. J. Bacteriol 160:499-503 

(4) C.F. Amabile-Cuevas, era/., 1991. Nucleic Acids Res. 19:4479-4484 
10 (5) J. Wu, et al., 1991. J. Bacteriol. 173:2864-2871 

(6) M.K. Wolf, et al., 1990. Infect. Immun. 58:1 124-1 128 

(7) CM. Stoner, et al. 1982. J. Mol. Biol. 153:649-652 

(8) L.L. Parker, et al., 1990. Genetics 123:455-471 

(9) H. Mori, 1996. Unpublished data taken from the NCBI databases 
15 (10) P. Klaasen, et al, 1990. Mol. Microbiol. 4:1779-1783 

(1 1) M. Ahmed, et al, 1 994. J. Biol. Chem 269-28506-2851 3 

(12) C. Webster, et al, 1989. Gene 83:207-213 

(13) G. Plunkett, IE. 1995. Unpublished 

(14) CGarcia-Martin,efa/., 1992. J. Gen. Microbiol 138:1109-1116 
20 (15) G. Plunkett, IE., et al. 1993. Nucleic Acids Res. 21:3391-3398 

(16) C. G. Tate, et al. 1 992. J. Biol. Chem. 267:6923-6932 

(17) J.F. Tobin et al., 1987. J. Mol Biol 196:789-799 

(18) J. Nishitani, 1991. Gene 105:37-42 

(19) RE. Benz, et al, 1993. Zentralbl. Bakteriol. Parasitenkd. Infektionskr. Hyg. 
25 Abt.l Orig. 278:187-196 

(20) M. Duncan, et al, 1996. Unpublished data 

(21) HJ. Sofia, et al, 1994. Nucleic Acids Res. 22:2576-2586 

(22) F.R Blattner, et al, 1993. Nucleic Acids Res. 21:5408-5417 

(23) H. Ishida, et al, 1995. Antimicrob. Agents Chemother. 39:453-457 
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(24) M.C. Sulavik, et al, 1997. /. Bacterid. 179:1857-1866 

(25) K. Kaniga, et al, 1994. Mol Microbiol 13:555-568 

(26) J.R. Roth, et al 1993. J. Bacteriol 175:3303-3316 

(27) A.M. George, et al, 1983. /. Bacteriol. 155:541-548 
5 (28) R.D. Heischmann, et al., 1995. Science 269:469-512 

(29) E.E. Galyov, et al., 1991 . FEBSLett. 286:79-82 

(30) N.P. Hoe, et al., 1992. J. Bacteriol. 1 74:4275-4286 

(31) G. Cornells, et al, 1989. J. Bacteriol 171:254-262 

(32) D.R. Macinga, et al, 1995. J. Bacteriol 177:3407-3413 
10 (33) M.I. Steele, et al, 1992. J. Biol. Chem. 267:13585-13592 

(34) G. Deho, et al, 1995. Unpublished data 

(35) N. Mermod, et al, 1984. EMBO J. 3:2461-2466 

(36) S.J. Assinder, et al, 1992. Nucleic Acids Res. 20:5476 

(37) S J. Assinder, et al, 1993. J. Gen. Microbiol 139:557-568 
15 (38) E.G. Dudley, et al, 1996. J. Bacteriol. 178:701-704 

(39) D. Geelen, et al, 1995. Unpublished data 

(40) J. Kormanec, et al, 1995. Gene 165:77-80 

(41) C.W. Chen, et al, 1992. J. Bacteriol. 174:7762-7769 

(42) R.R. Russell, et al, 1992. Biol. Chem, 267:4631-4637 
20 (43) K.K. Leenhouts, et al, 1995. Unpublished data 

(44) J.W. Lin, et al, 1995. Biochem. Biophys. Res. Commun. 217:684-695 

(45) F. Morohoshi, et al 1990. Nucleic Acids Res. 18:5473-5480 

(46) M. Rosenberg, et al, 1979. Annu. Rev. Genet. 13:3 19-353 

(47) H. Yamamoto, et al, 1996. Microbiology 142:1417-1421 
25 (48) L.B. Bussey, et al, 1993. J. Bacteriol 175:6348-6353 

(49) P.G. Quirk, et al, 1994. Biochim. Biophys. Acta 1 1 86:27-34 

The term "transcription factor modulating compound" or transcription 
factor modulator" includes HTH protein modulating compounds, HTH protein 

30 modulators. Transcription factor modulating compounds include compounds which 
interact with one or more transcription factors, such that the activity of the transcription 
factor is modulated, e.g., enhanced or inhibited. The term also includes both AraC 
family modulating compounds and MarA family modulating compounds. In one 
embodiment, the transcription factor modulating compound is an inhibiting compound 

35 of a transcription factor, e.g., a prokaryotic transcription factor or a eukaryotic 

transcription activation factor, m one embodiment, the transcription factor modulating 
compounds modulate the activity of a transcription factor as measured by assays known 
in the art or LANCE assays such as those described in Example 8. m one embodiment, 
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the transcription factor modulating compound inhibits a particular transcription factor by 
about 10% or greater, about 40% or greater, about 50% or greater, about 60% or greater, 
about 70% or greater, about 80% or greater, about 90% or greater, about 95% or greater, 
or about 100% as compared to the activity of the transcription factor with out the 
5 transcription factor modulating compound. In another embodiment, the transcription 

factor modulating compound inhibits biofilm formation. In one embodiment, the j 
transcription factor modulating compound inhibits biofilm formation as measured by 
assays known in the art or the Crystal Violet assay described in Example 7. In one 
embodiment, the transcription factor of the invention inhibits the formation of a biofilm 

i 

10 by about 25% or more, 50% or more, 75% or more, 80% or more, 90% or more, 95% or 
more, 96% or more, 97% or more, 98% or more, 99% or more, 99.9% or more, 99.99% 

i 

or more, or by 1 00%, as compared to the formation of a biofilm without the transcription 
factor modulating compound. 

The term "HTH protein modulating compound" or << HTH protein 

15 modulator" includes compounds which interact with one or more HTH proteins such that 
the activity of the HTH protein is modulated, e.g., enhanced or, inhibited. In one 
embodiment, the HTH protein modulating compound is a MarA family polypeptide 
modulating compound. In one embodiment, the activity of the HTH protein is enhanced 
when it interacts with the HTH protein modulating compound. For example, the activity 

20 of the HTH protein may be increased by greater than 1 0%, greater the 20%, greater than 
50%, greater than 75%, greater than 80%, greater than 90%, or 100% of the activity of 
the HTH protein in the absence of the HTH modulating compound. In another 
embodiment, the activity of the HTH protein is decreased upon an interaction with the 
HTH protein modulating compound. In an embodiment, the activity of the HTH protein 

25 is decreased by about 25% or more, 50% or more, 75% or more, 80% or more, 90% or 
more, 95% or more, 96% or more, 97% or more, 98% or more, 99% or more, 99.9% or 
more, 99.99% or more, or by 100%, as compared to the activity of the protein of a HTH 
protein when not contacted with an HTH modulating compound of the invention using 
techniques and assays described herein. Values and ranges included and/or intermediate ! 

30 of the values set forth herein are also intended to be within the scope of the present 

invention, I 

i 

The term "MarA family polypeptide modulating compound" or "MarA I 
family modulating compound" include compounds which interact with one or more 
35 MarA family polypeptides such that the activity of the MarA family peptide is enhanced 
or inhibited. In an embodiment, the MarA family polypeptide modulating compound is 
an inhibiting compound. In a further embodiment, the MarA family inhibiting 
compound is an inhibitor of MarA, Rob, and/or SoxS. In another embodiment, the 
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MarA family polypeptide modulating compound modulates the expression of luciferase 
in the Luciferase Assay described in Example 9. In one embodiment, the MarA family 
polypeptide modulating compound decreases luciferase expression by greater than 10%, 
greater than 20%, greater than 30%, greater than 40%, greater than 50%, greater than 
5 60%, greater than 70%, greater than 80%, greater than 90% or about 100%. 

The term "polypeptide(s) ff refers to a peptide or protein comprising two or 
more amino acids joined to each other by peptide bonds or modified peptide bonds. 
lf Polypeptide(s)" includes both short chains, commonly referred to as peptides, 
oligopeptides and oligomers and longer chains generally referred to as proteins. 

10 Polypeptides may contain amino acids other than the 20 gene encoded amino acids. 
"Polypeptide(s) 11 include those modified either by natural processes, such as processing 
and other post-translational modifications, but also by chemical modification techniques. 
Such modifications are well described in basic texts and in more detailed monographs, 
as well as in a voluminous research literature, and they are well known to those of skill 

15 in the art. It will be appreciated that the same type of modification may be present in the 
same or varying degree at several sites in a given polypeptide. Also, a given polypeptide 
may contain many types of modifications. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains, and the amino 
or carboxyl termini. Modifications include, for example, acetylation, acylation, ADP- 

20 ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme 
moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, 
cross-linking, cyclization, disulfide bond formation, demethylation, formation of 
covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, 

25 gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, 
methylation, myristoylation, oxidation, proteolytic processing, phosphorylation, 
prenylation, racemization, glycosylation, lipid attachment, sulfation, gamma- 
carboxylation of glutamic acid residues, hydroxylation and ADP-ribosylation, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins, 

30 such as arginylation, and ubiquitination. See, for instance, Proteins-Structure And 

Molecular Properties, 2 nd Ed., T. E. Creighton, W. H. Freeman and Company, New York 
(1993) and Wold, R, Posttranslational Protein Modifications: Perspectives and 
Prospects, pgs. 1-12 in Posttranslational Covalent Modification Of Proteins, B. C, 
Johnson, Ed., Academic Press, New York (1983); Seifter et al., Meth. Enzymol. 

35 1 82:626-646 (1990) and Rattan et al., Protein Synthesis: Posttranslational Modifications 
and Aging, Ann. N.Y. Acad. Sci. 663: 48-62 (1992). Polypeptides may be branched or 
cyclic, with or without branching. Cyclic, branched and branched circular polypeptides 
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may result from post-translational natural processes and may be made by entirely 
synthetic methods, as well. 

As used herein, the term "winged helix" includes dimeric transcription 
factors in which each monomer comprises a helix-turn-helix motif followed by one or 

5 two p-hairpin wings (Brennan. 1993. Cell 74:773; Gajiwala and Burley. 2000. Curr. 
Opin. Struct Biol 10:110). The classic winged helix motif comprises two wings, three 
a helices, and three p strands in the sequence H1-B1-H2-T-H3-B2-W1-B3-W2 (where H 
is a helix, B is a p strand, T is a turn, and W is a wing), although some variation in 
structure has been demonstrated ( Huffinan and Brennan. 2002. Current Opinion in 

10 Structural Biology, 12:98). 

As used herein the term "looped-hinge helix" included transcription 
factors, such as AbrB which, in the absence of DNA, have revealed a dimeric N-terminal 
region consisting of a four-stranded p sheet and a C-terminal DNA-binding region 
comprising one a helix and a "looped hinge" (see, e.g., Huffinan and Brennan. 2002 

15 Current Opinion in Structural Biology 12:98). Residues corresponding to R23 and R24 
of AbrB are critical for DNA recognition and contribute to the electropositive nature of 
the DNA-binding region. 

Preferred polypeptides (and the nucleic acid molecules that encode them) 
are "naturally occurring." As used herein, a "naturally-occurring" molecule refers to a 

20 molecule having an amino acid or a nucleotide sequence that occurs in nature (e.g. , a 
natural polypeptide). In addition, naturally or non-naturally occurring variants of the 
polypeptides and nucleic acid molecules which retain the same functional activity, (such 
as, the ability to bind to target nucleic acid molecules (e.g., comprising a marbox) or to 
polypeptides (e.g. RNA polymerase) with a naturally occurring polypeptide are provided 

25 for. Such immunologic cross-reactivity can be demonstrated, e.g. , by the ability of a 
variant to bind to a MarA family polypeptide responsive element. Such variants can be 
made, e.g., by mutation using techniques that are known in the art. Alternatively, 
variants can be chemically synthesized. 

As used herein the term "variant(s)" includes nucleic acid molecules or 

30 polypeptides that differ in sequence from a reference nucleic acid molecule or 

polypeptide, but retain its essential properties. Changes in the nucleotide sequence of the 
variant may, or may not, alter the amino acid sequence of a polypeptide encoded by the 
reference nucleic acid molecule. Nucleotide or amino acid changes may result in amino 
acid substitutions, additions, deletions, fusions and truncations in the polypeptide 

35 encoded by a naturally occurring reference sequence. A typical variant of a polypeptide 
differs in amino acid sequence from a reference polypeptide. Generally, differences are 
limited so that the sequences of the reference polypeptide and the variant are closely 
similar overall and, in many regions, identical. A variant and reference polypeptide may 
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differ in amino acid sequence by one or more substitutions, additions, and/or deletions in 
any combination. 

A variant of a nucleic acid molecule or polypeptide may be naturally 
occurring, such as an allelic variant, or it may be a variant that is not known to occur 
5 naturally. Non-naturally occurring variants of nucleic acid molecules and polypeptides 
may be made from a reference nucleic acid molecule or polypeptide by mutagenesis 
techniques, by direct synthesis, and by other recombinant methods known to skilled 
artisans. Alternatively, variants can be chemically synthesized. For instance, artificial or 
mutant forms of autologous polypeptides which are functionally equivalent, (e.g., have 

10 the ability to interact with a MarA family polypeptide responsive element) can be made 
using techniques which are well known in the art. 

Mutations can include, e.g., at least one discrete point mutation which can 
give rise to a substitution, or by at least one deletion or insertion. For example, 
mutations can also be made by random mutagenesis or using cassette mutagenesis. For 

15 the former, the entire coding region of a molecule is mutagenized by one of several 
methods (chemical, PCR, doped oligonucleotide synthesis) and that collection of 
randomly mutated molecules is subjected to selection or screening procedures. In the 
latter, discrete regions of a polypeptide, corresponding either to defined structural or 
functional determinants are subjected to saturating or semi-random mutagenesis and 

20 these mutagenized cassettes are re-introduced into the context of the otherwise wild type 
allele. In one embodiment, PCR mutagenesis can be used. For example, Megaprimer 
PCR can be used (O.H. Landt, 1990. Gene 96:125-128). 

In preferred embodiments, a MarA family polypeptide excludes one or more of 
25 XylS, AraC, and MelR. In other preferred embodiments, a MarA family polypeptide is 
involved in antibiotic resistance. In particularly preferred embodiments, a MarA family 
polypeptide is selected from the group consisting of: MarA, RamA, AarP, Rob, SoxS, 
and PqrA. 

The language "activity of a transcription factor" includes the ability of a 
30 transcription factor to interact with DNA, e.g., to bind to a transcription factor 

responsive promoter, or to initiate transcription from such a promoter. The language 
expressly includes the activities of AraC family polypeptides, HTH proteins and MarA 
family polypeptides. 

The language "activity of a MarA family polypeptide" includes the ability of the 
35 MarA family polypeptide to interact with DNA, e.g. , to bind to a MarA family 
polypeptide responsive promoter, or to initiate transcription from such a promoter. 
MarA functions both as a transcriptional activator (e.g., upregulating genes such as inaA, 
galT, micF, etc.) and as a repressor (e.g. 9 downregulating genes such as fecA, purA, 
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guaB, etc.) (Alekshun, 1 997, Antimicrob. Agents Chemother. 41:2067-2075; Barbosa & 
Levy, J. Bact 2000, Vol 182, p. 3467-3474; Pomposiello et al J. Bact. 2001, Vol 183, 
p. 3890-3902). 

The language "transcription factor responsive element* * includes a nucleic acid 
5 sequence which can interact with a transcription factor (e.g., promoters or enhancers or 
operators) which are involved in initiating transcription of an operon in a microbe. 
Transcription factor responsive elements responsive to various transcription factors are 
known in the art and additional responsive elements can be identified by one of ordinary 
skill in the art. For example, microarray analysis can be used to identify genes that are 
10 regulated by a transcription factor of interest. For interest, genes regulated by a 

transcription factor would be expressed at higher levels in wild type cells than in cells 
which are deleted for the transcription factor. In addition, genes responsive to a given 
transcription factor would comprise one or more target sequences responsive to the 
transcription factor in their promoter regions (Lyons et al. 2000. PNAS 97:7957). 
15 Exemplary responsive elements include: araBAD, araE, araFGH (responsive to AraC); 
melBAD (responsive to MelR); rhaSR (responsive to RhaR); rahBAD, rhaT (responsive 
to RhaS); Pm (responsive to XylS); fumC, inaA, micF, nfo, pai5, sodA, tolC, acrAB, 
fldA, fpr, mar, poxB, ribA, and zwf (responsive to MarA, SoxS, Rob); and coo, rns 
(responsive to Rns). 

20 The language "marA family polypeptide responsive element" includes a nucleic 

acid sequence which can interact with marA, e.g., promoters or enhancers which are 
involved in regulating transcription of a nucleic acid sequence in a microbe. MarA 
responsive elements comprise approximately 16 base pair maibox sequence, the 
sequence critical for the binding of MarA to its target In addition, a secondary site, the 

25 accessory marbox, upstream of the primary maibox contributes to basal and derepressed 
mar transcription. A marbox may be situated in either the forward or backward 
orientation. (Martin, 1999, Mol Microbiol 34:431-441). In the marRAB operon, the 
marbox is in the backward orientation and is thus located on the sense strand with 
respect to marRAB (Martin, 1999, Mol Microbiol 34:431-441). Subtle differences 

30 within the marbox sequence of particular promoters may account for differential 

regulation by MarA and other related, e.g., SoxS and Rob, transcription factors (Martin, 
2000, Mol Microbiol, 35(3):623-34). In one embodiment, MarA family responsive 
elements are promoters that are structurally or functionally related to a marA promoter, 
e.g, interact with MarA or a protein related to MarA. Preferably, the marA family 

35 polypeptide responsive element is a marRAB promoter. For example, in the mar operon, 
several promoters are marA family polypeptide responsive promoters as defined herein, 
e.g., the 405-bp Thai fragment from the marO region is a marA family responsive 
promoter (Cohen et al 1993. J. Bact 175:7856). In addition, MarA has been shown to 
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bind to a 16 bp MarA binding site (referred to as the "marbox" within marO (Martin et 
al 1996. J. Bacterid 178:2216). MarA also affects transcription from the acrAB; 
micF; mlr 1,2,3; sip; nfo; inaA;Jpr; sodA; soi-17J9; zwf;fumC; or rpsF promoters 
(Alekshun and Levy. 1997. Antimicrobial Agents and Chemother. 41:2067). Other 
5 mar A family responsive promoters are known in the art and include: araBAD, araE, 
araFGHand araC, which are activated by AraC; Pm, which is activated by XylS; 
melAB which is activated by MelR; and oriC which is bound by Rob. 

The language "MarA family polypeptide responsive promoter 11 also 
includes portions of the above promoters which are sufficient to activate transcription 

10 upon interaction with a MarA family member protein. The portions of any of the MarA 
family polypeptide-responsi ve promoters which are minimally required for their activity 
can be easily determined by one of ordinary skill in the art, e.g., using mutagenesis. 
Exemplary techniques are described by Gallegos et al (1996, J. Bacterid 178:6427). 
A "MarA family polypeptide responsive promoter" also includes non-naturally occurring 

15 variants of MarA family polypeptide responsive promoters which have the same function 
as naturally occurring MarA family promoters. Preferably such variants have at least 
30% or greater, 40% or greater, or 50% or greater, nucleotide sequence identity with a 
naturally ^occurring MarA family polypeptide responsive promoter. In preferred 
embodiments, such variants have at least about 70% nucleotide sequence identity with a 

20 naturally occurring MarA family polypeptide responsive promoter. In more preferred 
embodiments, such variants have at least about 80% nucleotide sequence identity with a 
naturally occurring MarA family polypeptide responsive promoter. In particularly 
preferred embodiments, such variants have at least about 90% nucleotide sequence 
identity and preferably at least about 95% nucleotide sequence identity with a naturally 

25 occurring MarA family polypeptide responsive promoter. In yet other embodiments 
nucleic acid molecules encoding variants of MarA family polypeptide responsive 
promoters are capable of hybridizing under stringent conditions to nucleic acid 
molecules encoding naturally occurring MarA family polypeptide responsive promoters. 

In one embodiment, the methods described herein can employ molecules 

30 identified as responding to the transcription factors of the invention, i.e., molecules in a 
regulon whose expression is controlled by the transcription factor. For example, 
compounds that modulate transcription of genes that are directly modulated by a 
microbial transcription factor (e.g., a marA family transcription factor) can be used to 
modulate virulence of a microbe or modulate infection by a microbe. In another 

35 embodiment, such genes can be identified as important in controlling virulence using the 
methods described herein. As used herein, the term **regulon" includes two or more loci 
in two or more different operons whose expression is regulated by a common repressor 
or activator protein. 
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The term "interact" includes close contact between molecules that results 
in a measurable effect, e.g., the binding of one molecule with another. For example, a 
MarA family polypeptide can interact with a MarA family polypeptide responsive 
element and alter the level of transcription of DNA. Likewise, compounds can interact 
5 with a MarA family polypeptide and alter the activity of a MarA family polypeptide. 

The term "inducible promoter" includes promoters that are activated to 
induce the synthesis of the genes they control. As used herein, the term "constitutive 
promoter" includes promoters that do not require the presence of an inducer, e.g. t are 
continuously active. 

10 The terms "heterologous DNA" or "heterologous nucleic acid" includes 

DNA that does not occur naturally in the cell (e.g., as part of the genome) in which it is 
present or which is found in a location or locations in the genome that differ from that in 
which it occurs in nature or which is operatively linked to DNA to which it is not 
normally linked in nature (i.e., a gene that has been operatively linked to a heterologous 

15 promoter). Heterologous DNA is 1) not naturally occurring in a particular position {e.g., 
at a particular position in the genome) or 2) is not endogenous to the cell into which it is 
introduced, but has been obtained from another cell. Heterologous DNA can be from the 
same species or from a different species. Any DNA that one of skill in the art would 
recognize or consider as heterologous or foreign to the cell in which is expressed is 

20 herein encompassed by the term heterologous DNA. 

The terms "heterologous protein", "recombinant protein", and "exogenous 
protein" are used interchangeably throughout the specification and refer to a polypeptide 
. which is produced by recombinant DNA techniques, wherein generally, DNA encoding 
the polypeptide is inserted into a suitable expression vector which is in turn used to 

25 transform a host cell to produce the heterologous protein. That is, the polypeptide is 
expressed from a heterologous nucleic acid molecule. 

The term "microbe" includes microorganisms expressing or made to 
express a transcription factor, araC family polypeptide, HTH protein, or a marA family 
polypeptide. "Microbes" are of some economic importance, e.g., are environmentally 

30 important or are important as human pathogens. For example, in one embodiment 
microbes cause environmental problems, e.g., fouling or spoilage, or perform useful 
functions such as breakdown of plant matter. In another embodiment, microbes are 
organisms that live in or on mammals and are medically important Preferably microbes 
are unicellular and include bacteria, fungi, or protozoa. la another embodiment, 

35 microbes suitable for use in the invention are multicellular, e.g., parasites or fimgi. In 
preferred embodiments, microbes are pathogenic for humans, animals, or plants. 
Microbes may be used as intact cells or as sources of materials for cell-free assays. In 
one embodiment, the microbes include prokaryotic organisms. In other embodiments, 
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the microbes include eukaryotic organisms. Exemplary bacteria that comprise MarA 
homologs include the following: 

MarA 

E. coli 

UPEC (uropathogenic) 

EPEC (enteropathogenic) 
Salmonella enterica 

Cholerasuis (septicemia) 

Enteritidis enteritis 

Typhimurium enteritis 

Typhimurium (multi-drug resistant) 
Yersinia enterocolitica 
Yersinia pestis 
Pseudomonas aeruginosa 
Enterobacterspp. 
Klebsiella sp. 
Proteus spp. 
Vibrio cholerae 
Shigella sp. 
Providencia stuartii 
Neisseria meningitidis 
Mycobacterium tuberculosis 
Mycobacterium leprae 
Staphylococcus aureus 
Streptococcus pyogenes 
Enterococcus faecalis 
Bordetella pertussis 

Bordetella bronchiseptica 

The term selective marker includes polypeptides that serve as indicators, 
5 e .g. , provide a selectable or screenable trait when expressed by a cell. The term 

"selective marker" includes both selectable markers and counterselectable markers. As 
used herein the term "selectable marker" includes markers that result in a growth 
advantage when a compound or molecule that fulfills the test parameter of the assay is 
present. The term "counterselectable marker" includes markers that result in a growth 
1 0 disadvantage unless a compound or molecule is present which disrupts a condition 

giving rise to expression of the counterselectable marker. Exemplary selective markers 
include cytotoxic gene products, gene products that confer antibiotic resistance, gene 
products that are essential for growth, gene products that confer a selective growth 
disadvantage when expressed in the presence of a particular metabolic substrate (e.g., the 
1 5 expression of the URA3 gene confers a growth disadvantage in the presence of 5- 
fluoroorotic acid). 
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As used herein the tenia "reporter gene" includes any gene which encodes an 
easily detectable product which is operably linked to a regulatory sequence, e.g., to a 
transcription factor responsive promoter. By operably linked it is meant that under 
appropriate conditions an RNA polymerase may bind to the promoter of the regulatory 
5 region and proceed to transcribe the nucleotide sequence such that the reporter gene is 
transcribed- In preferred embodiments, a reporter gene consists of the transcription 
factor responsive promoter linked in frame to the reporter gene. In certain embodiments, 
however, it may be desirable to include other sequences, e.g, transcriptional regulatory 
sequences, in the reporter gene construct. For example, modulation of the activity of the 

1 0 promoter may be effected by altering the RNA polymerase binding to the promoter 

region, or, alternatively, by interfering with initiation of transcription or elongation of the 
mRNA. Thus, sequences which are herein collectively referred to as transcriptional 
regulatory elements or sequences may also be included in the reporter gene construct. In 
addition, the construct may include sequences of nucleotides that alter translation of the 

15 resulting mRNA, thereby altering the amount of reporter gene product. 

Examples of reporter genes include, but are not limited to CAT (chloramphenicol 
acetyl transferase) (Alton and Vapnek (1979), Nature 282: 864-869) luciferase, and other 
enzyme detection systems, such as beta-galactosidase; firefly luciferase (deWet et al. 
(1987), Mol Cell Biol 7:725-737); bacterial luciferase (Engebrecht and Silverman 

20 (1984), PNAS 1: 4154-4158; Baldwin et al. (1984), Biochemistry 23: 3663-3667); PhoA, 
alkaline phosphatase (Toh et al. (1989) Eur. J. Biochem. 182: 231-238, Hall et al. (1983) 
J. Mol. Appl. Gen. 2: 101), human placental secreted alkaline phosphatase (Cullen and 
Malim (1992) Methods in Enzymol 216:362-368) and green fluorescent protein (U.S. 
patent 5,491,084; W096/23898). 

25 In certain embodiments of the invention it will be desirable to obtain "isolated or 

recombinant" nucleic acid molecules transcription factors or mutant forms thereof. The 
term "isolated or recombinant" includes nucleic acid molecules which have been, e.g., 
(1) amplified in vitro by, for example, polymerase chain reaction (PCR); (2) 
recombinantly produced by cloning, or (3) purified, as by cleavage and gel separation; or 

30 (4) synthesized by, for example, chemical synthesis. Such a nucleic acid molecule is 
isolated from the sequences which naturally flank it in the genome and from cellular 
components. 

In yet other embodiments of the invention, it will be desirable to obtain a 
substantially purified or recombinant transcription factor. Such polypeptides, for 
35 example, can be purified from cells which have been engineered to express an isolated 
or recombinant nucleic acid molecule which encodes a transcription factor. For 
example, as described in more detail below, a bacterial cell can be transformed with a 
plasmid which encodes a transcription factor. The transcription factor can then be 
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purified from the bacterial cells and used, for example, in the cell-free assays described 
herein or known in the art. 

As used herein, the term "antibiotic" includes antimicrobial agents isolated from 
natural sources or chemically synthesized. The term "antibiotic" refers to antimicrobial 
5 agents for use in human therapy. Preferred antibiotics include: tetracycline, 

fluoroquinolones, chloramphenicol, penicillins, cephalosporins, puromycin, nalidixic 

acid, and rifampin. 

The term "test compound" includes any reagent or test agent which is 
employed in the assays of the invention and assayed for its ability to influence the 
10 activity of a transcription factor, e.g., an AraC family polypeptide, an HTH protein, or a 
Mar A family polypeptide, e.g, by binding to the polypeptide or to a molecule with 
which it interacts. More than one compound, e.g., a plurality of compounds, can be 
tested at the same time for their ability to modulate the activity of a transcription factor, 
e.g., an AraC family polypeptide, an HTH protein, or a MarA family polypeptide, 
15 activity in a screening assay. In an advantageous embodiment, the test compound is a 
MarA family modulating compound- 
Test compounds that can be tested in the subject assays include antibiotic 
and non-antibiotic compounds. In one embodiment, test compounds include candidate 
detergent or disinfectant compounds. Exemplary test compounds which can be screened 
20 for activity include, but are not limited to, peptides, non-peptidic compounds, nucleic 
acids, carbohydrates, small organic molecules {e.g., polyketides), and natural product 
extract libraries. The term "non-peptidic test compound" includes compounds that are 
comprised, at least in part, of molecular structures different from naturally-occurring L- 
amino acid residues linked by natural peptide bonds. However, "non-peptidic test 
25 compounds" also include compounds composed, in whole or in part, of peptidomimetic 
structures, such as D-amino acids, non-naturally-occurring L-amino acids, modified 
peptide backbones and the like, as well as compounds that are composed, in whole or in 
part, of molecular structures unrelated to naturally-occurring L-amino acid residues 
linked by natural peptide bonds. "Non-peptidic test compounds" also are intended to 
30 include natural products. 

In one embodiment, small molecules can be used as test compounds. The 
term "small molecule" is a term of the art and includes molecules that are less than about 
1000 molecular weight or less than about 500 molecular weight. In one embodiment, 
small molecules do not exclusively comprise peptide bonds. In another embodiment, 
35 small molecules are not oligomeric. Exemplary small molecule compounds which can 
be screened for activity include, but are not limited to, peptides, peptidomimetics, 
nucleic acids, carbohydrates, small organic molecules (e.g., polyketides) (Cane et al. 
1998. Science 282:63), and natural product extract libraries. In another embodiment, the 
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compounds are small, organic non-peptidic compounds. In a further embodiment, a 
small molecule is not biosynthetic. 

The term "antagonist" includes transcription factor modulating 
compounds (e.g., AraC family polypeptide modulating compounds, HTH protein 
5 modulating compounds, MarA family polypeptide modulating compounds, etc.) which 
inhibit the activity of a transcription factor by binding to and inactivating the 
transcription factor (e.g., an AraC family modulating compound, an MarA family 
polypeptide modulating compound, etc.), by binding to a nucleic acid target with which 
the transcription factor interacts (e.g. , for MarA, a marbox), by disrupting a signal 

10 transduction pathway responsible for activation of a particular regulon (e.g., for Mar, the 
inactivation of MarR or activation of MarA synthesis), and/or by disrupting a critical 
protein-protein interaction (e.g., MarA-RNA polymerase interactions that are required 
for MarA to function as a transcription factor.) Antagonists may include, for example, 
naturally or chemically synthesized compounds such as small cell permeable organic 

1 5 molecules, nucleic acid interchelators, peptides, etc. 

The term "agonist" includes transcription factor modulating compounds 
(e.g., AraC family polypeptide modulating compounds, HTH protein modulating 
compounds, MarA family polypeptide modulating compounds, etc.) which promote the 
activity of a transcription factor by binding to and activating the transcription factor (e.g., 

20 an AraC family modulating compound, an MarA family polypeptide modulating 

compound, etc.), by binding to a nucleic acid target with which the transcription factor 
interacts (e.g., for MarA, a marbox), by facilitating a signal transduction pathway 
responsible for activation of a particular regulon (e.g. , for Mar, the inactivation of MarR 
or activation of MarA synthesis), and/or by facilitating a critical protein-protein 

25 interaction (e.g., MarA-RNA polymerase interactions that are required for MarA to 
function as a transcription factor.) Agonists may include, for example, naturally or 
chemically synthesized compounds such as small cell permeable organic molecules, 
nucleic acid interchelators, peptides, etc. 

30 II. MarA Family polypeptide Helix-Turn-Helix Domains 

Helix-turn-helix domains are known in the art and have been implicated 

in DNA binding (Ann Rev. of Biochem. 1984. 53:293). An example of the consensus 

sequence for a helix-tum domain can be found in Brunelle and Schleif (1989, J. Mol. 

Biol. 209:607). The domain has been illustrated by the sequence 
35 XXXPhoAlaXXPhoGlyPhoXXXXPhoXXPhoXX, where X is any amino acid and Pho is 

a hydrophobic amino acid. 
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The crystal structure of Mar A has been determined and the first (most 
amino terminal) HTH domain of MarA has been identified as comprising from about 
amino acid 3 1 to about amino acid 52 and the second HTH domain of MarA has been 
identified as comprising from about amino acid 79 to about amino acid 102 (Rhee et al. 
5 1998. Proc. Natl. Acad. Set USA. 95:10413). 

Locations of the helix-turn-helix domains in other MarA family members 
as well as other HTH proteins can easily be found by one of skill in the art. For example 
using the MarA protein sequence and an alignment program, e.g., the ProDom program 
or other programs known in the art, a portion of the MarA amino acid sequence e.g., 

10 comprising one or both HTH domains of MarA (such as from about amino acid 30 to 
about amino acid 107 of MarA) to produce an alignment. Using such an alignment, the 
amino acid sequences corresponding to the HTH domains of MarA can be identified in 
other MarA family member proteins. An exemplary consensus sequence for the first 
helix-turn-helix domain of a MarA family polypeptide can be illustrated as 

15 XXXXAXXXXXSXXXLXXXFX, where X is any amino acid. An exemplary 
consensus sequence for the second helix-turn-helix domain of a MarA family 
polypeptide is illustrated as XXIXXIAXXXGFXSXXXFXXX[F/Y], where X is any 
amino acid. Preferably, a MarA family polypeptide first helix-turn-helix domain 
comprises the consensus sequence E/D-X-V/L-A-D/E-X-A/S-G-X-S-X3-L-Q-X2-F- 

20 K/R/E-X2-T/L Preferably, a MarA family polypeptide second helix-turn-helix domain 
comprises the consensus sequence I-X-D-I-A-X3-G-F-X-S-X2-F-X3-F-X4. 

In an embodiment, a MarA family member HTH domain is a MarA HTH 
domain. The first and second helix-turn-helix domains of MarA are, respectively, 
EKVSERSGYSKWHLQRMFKKET and ILYLAERYGFESQQTLTRTFKNYF. Other 

25 exemplary MarA family helix-turn-helix domains include: about amino acid 210 to 

about amino acid 229 and about amino acid 259 to about amino acid 278 of MelR; about 
amino acid 196 to about amino acid 215 and about amino acid 245 to about amino acid 
264 of AraC; and about amino acid 230 to about amino acid 249 (or 233-253) and about 
amino acid 281 to about amino acid 301 (or 282-302) of XylS (see e.g., Brunelle et al 

30 1989. 1 Mol Biol 209:607; Niland et al. 1996. J. Mol. Biol. 264:667; Gallegos et al 
1997. Microbiology and Molecular Biology Reviews. 61:393). 

"MarA family polypeptide helix-turn-helix domains" are derived from or 
are homologous to the helix-turn-helix domains found in the MarA family polypeptides 
as described supra. In preferred embodiments, a MarA family polypeptide excludes one 

35 or more of XylS, AraC, and MelR. In particularly preferred embodiments, a MarA 
family polypeptide is selected from the group consisting of: MarA, RamA, AarP, Rob, 
SoxS, and PqrA. 



-33- 



WO 2004/041209 



PCT/US2003/035205 



Both of the helix-tum-helix domains present in Mar A family polypeptides 
are in the carboxy terminal end of the protein. Proteins or portions thereof comprising 
either or both of these domains can be used in the instant methods. In certain 
embodiments, a polypeptide which is used in screening for compounds comprises the 

5 helix-tum-helix domain most proximal to the carboxy terminus (HTH2) of the MarA 
family polypeptide from which it is derived. In other embodiments, such a polypeptide 
comprises the helix-tum-helix domain most proximal to the amino terminus (HTH1) of 
the MarA family polypeptide from which it is derived. In one embodiment, other 
polypeptide sequences may also be present, e.g., sequences that migiht facilitate 

10 immobilizing the domain on a support, or, alternatively, might facilitate the purification 
of the domain. 

In an embodiment, such a polypeptide consists essentially of the helix- 
turn-helix domain most proximal to the carboxy terminus of the MarA family 
polypeptide from which it is derived. In other preferred embodiments, such a 

15 polypeptide consists essentially of the helix-turn-helix domain most proximal to the 
amino terminus of the MarA family polypeptide from which it is derived. 

In an embodiment, such a polypeptide consists of the helix-tum-helix 
domain most proximal to the carboxy terminus of the AraC family polypeptide or MarA 
family polypeptide from which it is derived. In other preferred embodiments, such a 

20 polypeptide consists of the helix-tum-helix domain most proximal to the amino terminus 
of the AraC family polypeptide or MarA family polypeptide from which it is derived. 

MarA family polypeptide or AraC family polypeptide helix-turn-helix 
domains can be made using techniques which are known in the art. The nucleic acid and 
amino acid sequences of transcription factors, such as MarA family polypeptides, are 

25 available, for example, from GenBank. Using this information, the helix-tum-helix 
consensus motif and mutational analysis provided herein, one of ordinary skill in the art 
can identify MarA family or AraC family polypeptide helix-tum-helix domains. 

In certain embodiments of the invention it will be desirable to obtain 
"isolated or recombinant" nucleic acid molecules encoding transcription factors or 

30 portions thereof (e.g., HTH protein helix-tum-helix domains, AraC family helix-tum- 
helix domains, MarA family helix-tum-helix domains or mutant forms thereof). By 
"isolated or recombinant" is meant a nucleic acid molecule which has been (1) amplified 
in vitro by, for example, polymerase chain reaction (PCR); (2) recombinant!/ produced 
by cloning, or (3) purified, as by cleavage and gel separation; or (4) synthesized by, for 

35 example, chemical synthesis. Such a nucleic acid molecule is isolated from the 
sequences which naturally flank it in the genome and from cellular components. 
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The isolated or recombinant nucleic acid molecules encoding 
transcription factors (e.g., HTH protein helix-tum-helix domains, AraC family helix- 
tum-helix domains, MarA family helix-tum-helix domains or mutant forms thereof) can 
then, for example, be utilized in binding assays, can be expressed in a cell, or can be 
5 expressed on the surface of phage, as discussed further below. 

In yet other embodiments of the invention, it will be desirable to obtain a 
substantially purified or recombinant HTH protein helix-tum-helix domains (e.g., MarA 
family helix-tum-helix domains or mutant forms thereof). Such polypeptides, for 
example, can be purified from cells which have been engineered to express an isolated or 

10 recombinant nucleic acid molecule which encodes a HTH protein helix-tum-helix 
domain (e.g., MarA family helix-tum-helix domain or mutant forms thereof)- For 
example, as described in more detail below, a bacterial cell can be transformed with a 
plasmid which encodes a MarA family helix-tum-helix domain. The MarA family helix- 
tum-helix protein can then be purified from the bacterial cells and used, for example, in 

15 the cell-free assays described herein. 

Purification of a HTH protein helix-tum-helix domain (e.g., MarA family 
helix-tum-helix domain) can be accomplished using techniques known in the art. For 
example, column chromatography could be used, or antibodies specific for the domain or 
for a polypeptide fused to the domain can be employed, for example on a column or in a 

20 panning assay. 

In preferred embodiments, cells used to express HTH protein helix-tum- 
helix domains (e.g., MarA family helix-tum-helix domains or mutant forms thereof) for 
purification, e.g., host cells, comprise a mutation which renders any endogenous HTH 
proteins nonfunctional or causes the endogenous protein to not be expressed. In other 
25 embodiments, mutations may also be made in MarR or related genes of the host cell, 
such that repressor proteins which bind to the same promoter as a MarA family 
polypeptide are not expressed by the host cell. 

In certain embodiments of the invention, it will be desirable to use a 
mutant form of a HTH protein helix-turn helix domain, e.g., a non-naturally occurring 
30 form of a MarA family helix-tum-helix domain which has altered activity, e.g., does not 
retain wild type MarA family polypeptide helix-tum-helix domain activity, or which has 
reduced activity or which is more active when compared to a wild-type MarA family 
polypeptide helix-tum-helix domain. 

Such mutant forms can be made using techniques which are well known 
35 in the art. For example, random mutagenesis can be used. Using random mutagenesis 
one can mutagenize an entire molecule or one can proceed by cassette mutagenesis. In 
the former instance, the entire coding region of a molecule is mutagenized by one of 
several methods (chemical, PCR, doped oligonucleotide synthesis) and that collection of 

-35- 



WO 2004/041209 



PCT/US2003/035205 



randomly mutated molecules is subjected to selection or screening procedures. In the 
second approach, discrete regions of a protein, corresponding either to defined structural 
or functional determinants (e.g., the first or second alpha helix of a helix-turn-helix 
domain) are subjected to saturating or semi-random mutagenesis and these mutagenized 

5 cassettes are re-introduced into the context of the otherwise wild type allele. 

In a preferred embodiment, PCR mutagenesis is used. For example, 
Example 2 describes the use of Megaprimer PCR (O.H. Landt, Gene 96: 125-128) used 
to introduce an NJiel restriction site into the centers of both the helix A (position 1989) 
and helix B (position 2016) regions of the marA gene. 

10 In one embodiment, such mutant helix-turn-helix domains comprise one 

or more mutations in the helix-turn-helix domain most proximal to the carboxy terminus 
(HTH2) of the MarA family polypeptide molecule. In a preferred embodiment, the 
mutation comprises an insertion into helix A and helix B of the helix-turn-helix domain 
most proximal to the carboxy terminus of the MarA family polypeptide. In one 

15 embodiment, such mutant helix-turn-helix domains comprise one or more mutations in 
the helix-turn-helix domain most proximal to the amino terminus (HTH1) of the MarA 
family polypeptide molecule. In a preferred embodiment, the mutation comprises an 
insertion into helix A and helix B of the helix-turn-helix domain most proximal to the 
amino terminus of the MarA family polypeptide. In particularly preferred embodiments, 

20 the mutation is selected from the group consisting of: an insertion at an amino acid 
corresponding to about position 33 of MarA and an insertion at an amino acid position 
corresponding to about position 42 of MarA. "Corresponding" amino acids can be 
determined, e.g., using an alignment of the helix-turn-helix domains. 

Such mutant forms of MarA family heUx-turn-helix motifs are useful as 

25 controls to verify the specificity of antiinfective compounds for a MarA family helix- 
turn-helix domain or as controls for the identification of genetic loci which affect 
resistance to antiinfectives. For example, the mutant MarA family helix-turn-helix 
domains described in the appended Examples demonstrate that insertional inactivation of 
MarA at either helix A or helix B in the first HTH domain abolished the multidrug 

30 resistance phenotype in both E. coli and M smegmatis. By the use of an assay system 
such as that described in Example 2, which demonstrates the ability of MarA family 
polypeptide helix-tum-helix domains to increase antibiotic resistance and that mutant 
forms of these domains do not have the same effect, one can clearly show that the 
response of any genetic loci identified is specific to a MarA family helix-turn-helix 

35 domain. 
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The isolated or recombinant nucleic acid molecules encoding 
transcription factors (e.g., HTH protein helix-turn-helix domains, AraC family helix- 
turn-helix domains, MarA family helix-turn-helix domains or mutant forms thereof) can 
then, for example, be utilized in binding assays, can be expressed in a cell, or can be 
5 expressed on the surface of phage, as discussed further below. 

In yet other embodiments of the invention, it will be desirable to obtain a 
substantially purified or recombinant HTH protein helix-turn-helix domains (e.g., MarA 
family helix-turn-helix domains or mutant forms thereof). Such polypeptides, for 
example, can be purified from cells which have been engineered to express an isolated or 

10 recombinant nucleic acid molecule which encodes a HTH protein helix-turn-helix 
domain (e.g., MarA family helix-turn-helix domain or mutant forms thereof). For 
example, as described in more detail below, a bacterial cell can be transformed with a 
plasmid which encodes a MarA family helix-turn-helix domain. The MarA family helix- 
turn-helix protein can then be purified from the bacterial cells and used, for example, in 

15 the cell-free assays described herein. 

Purification of a HTH protein helix-turn-helix domain (e.g., MarA family 
helix-turn-helix domain) can be accomplished using techniques known in the art. For 
example, column chromatography could be used, or antibodies specific for the domain or 
for a polypeptide fused to the domain can be employed, for example on a column or in a 

20 panning assay. 

In preferred embodiments, cells used to express HTH protein helix-turn- 
helix domains (e.g., MarA family helix-turn-helix domains or mutant forms thereof) for 
purification, e.g., host cells, comprise a mutation which renders any endogenous HTH 
proteins nonfunctional or causes the endogenous protein to not be expressed. In other 
25 embodiments, mutations may also be made in MarR or related genes of the host cell, 
such that repressor proteins which bind to the same promoter as a MarA family 
polypeptide are not expressed by the host cell. 

In certain embodiments of the invention, it will be desirable to use a 
mutant form of a HTH protein helix-turn helix domain, e.g., a non-naturally occurring 
30 form of a MarA family helix-turn-helix domain which has altered activity, e.g., does not 
retain wild type MarA family polypeptide helix-turn-helix domain activity, or which has 
reduced activity or which is more active when compared to a wild-type MarA family 
polypeptide helix-turn-helix domain. 

Such mutant forms can be made using techniques which are well known 
35 in the art. For example, random mutagenesis can be used. Using random mutagenesis 
one can mutagenize an entire molecule or one can proceed by cassette mutagenesis. In 
the former instance, the entire coding region of a molecule is mutagenized by one of 
several methods (chemical, PCR, doped oligonucleotide synthesis) and that collection of 
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1/7. Expression of Polypeptide or Portions Thereof 

Nucleic acids encoding transcription factors, such as AraC family 
polypeptides, HTH proteins, e.g., MarA family polypeptides or selectable markers (or 
portions thereof that retain an activity of the full-length polypeptide, e.g., are capable of 
5 binding to a transcription factor responsive element or retain their indicator function) can 
be expressed in cells using vectors. Almost any conventional delivery vector can be 
used. Such vectors are widely available commercially and it is within the knowledge 
and discretion of one of ordinary skill in the art to choose a vector which is appropriate 
for use with a given microbial cell. The sequences encoding these domains can be 

1 0 introduced into a cell on a self-replicating vector or may be introduced into the 

chromosome of a microbe using homologous recombination or by an insertion element 
such as a transposon. 

These nucleic acids can be introduced into microbial cells using standard 
techniques, for example, by transformation using calcium chloride or electroporation. 

1 5 Such techniques for the introduction of DNA into microbes are well known in the art. 
In one embodiment, a nucleic acid molecule which has been amplified in vitro by, for 
example, polymerase chain reaction (PCR); recombinantly produced by cloning, or) 
purified, as by cleavage and gel separation; or synthesized by, for example, chemical 
synthesis can be used to produce MarA family polypeptides (George, A. M. & Levy, S. 

20 B. (1983)J. Bacteriol 155, 541-548; Cohen, S. P. et al. (1993) J Infect. Dis. 168, 

484-488; Cohen, S. P etal (1993) J Bacteriol 175, 1484-1492; Sulavick, M. C. etal 
(1997) J. Bacteriol 179, 1857-1866). 

Host cells can be genetically engineered to incorporate nucleic acid molecules of 
the invention. In one embodiment nucleic acid molecules specifying transcription 

25 factors can be placed in a vector. The term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid molecule to which it has been linked. The 
term "expression vector" or "expression system" includes any vector, (e.g, a plasmid, 
cosmid or phage chromosome) containing a gene construct in a form suitable for 
expression by a cell (e.g., linked to a promoter). In the present specification, "plasmid" 

30 and "vector" are used interchangeably, as a plasmid is a commonly used form of vector. 
Moreover, the invention is intended to include other vectors which serve equivalent 
functions. A great variety of expression systems can be used to produce the polypeptides 
of the invention. Such vectors include, among others, chromosomal, episomal and virus- 
derived vectors, e.g., vectors derived from bacterial plasmids, from bacteriophage, from 

35 transposons, from yeast episomes, from insertion elements, from yeast chromosomal 
elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and 
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vectors derived from combinations thereof, such as those derived from plasmid and 
bacteriophage genetic elements, such as cosmids and phagemids. 

Appropriate vectors are widely available commercially and it is within the 
knowledge and discretion of one of ordinary skill in the art to choose a vector which is 

5 appropriate for use with a given host cell. The sequences encoding a transcription factor, 
such as, for example, MarA family polypeptides, can be introduced into a cell on a self- 
replicating vector or may be introduced into the chromosome of a microbe using 
homologous recombination or by an insertion element such as a transposon. 
The expression system constructs may contain control regions that regulate 

10 expression. "Transcriptional regulatory sequence" is a generic term to refer to DNA 
sequences, such as initiation signals, enhancers, operators, and promoters, which induce 
or control transcription of polypeptide coding sequences with which they are operably 
linked. It will also be understood that a recombinant gene encoding a transcription 
factor gene, e.g., an HTH protein gene or an AraC family polypeptide, e.g., MarA family 

1 5 polypeptide, can be under the control of transcriptional regulatory sequences which are 
the same or which are different from those sequences which control transcription of the 
naturally-occurring transcription factor gene. Exemplary regulatory sequences are 
described in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990). For instance, any of a wide variety of 

20 expression control sequences, that control the expression of a DNA sequence when 
operatively linked to it, may be used in these vectors to express DNA sequences 
encoding the polypeptide. 

Generally, any system or vector suitable to maintain, propagate or express nucleic 
acid molecules and/or to express a polypeptide in a host may be used for expression in 

25 this regard. The appropriate DNA sequence may be inserted into the expression system 
by any of a variety of well-known and routine techniques, such as, for example, those set 
forth in Sambrook et al., Molecular Cloning, A Laboratory Manual, (supra). 

Exemplary expression vectors for expression of a gene encoding a polypeptide 
and capable of replication in a bacterium, e.g 9 a gram positive, gram negative, or in a 

30 cell of a simple eukaryotic fungus such as a Saccharomyces or, Pichia, or in a cell of a 
eukaryotic organism such as an insect, a bird, a mammal, or a plant, are known in the art. 
Such vectors may carry functional replication-specifying sequences (replicons) both for a 
host for expression, for example a Streptomyces, and for a host, for example, E. colU for 
genetic manipulations and vector construction. See, e.g. 9 U.S. 4,745,056. Suitable 

35 vectors for a variety of organisms are described in Ausubel, F. et al> Short Protocols in 
Molecular Biology, Wiley, New York (1 995), and for example, for Pichia, can be 
obtained from Invitrogen (Carlsbad, CA). 
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Useful expression control sequences, include, for example, the early and late 
promoters of SV40, adenovirus or cytomegalovirus immediate early promoter, the lac 
system, the trp system, the TAC or TRC system, T7 promoter whose expression is 
directed by T7 RNA polymerase, the major operator and promoter regions of phage 
5 lambda , the control regions for fd coat polypeptide, the promoter for 3- 

phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, 
e.g., Pho5, the promoters of the yeast cc-mating factors, the polyhedron promoter of the 
baculovirus system and other sequences known to control the expression of genes of 
prokaryotic or eukaryotic cells or their viruses, and various combinations thereof. A 

10 useful translational enhancer sequence is described in U.S. 4,820,639. 

In one embodiment, an inducible promoter will be employed to express a 
polypeptide of the invention. For example, in one embodiment, trp (induced by 
tryptophan), tac (induced by lactose), or tet (induced by tetracycline) can be used in 
bacterial cells, or GAL1 (induced by galactose) can be used in yeast cell. 

15 In another embodiment, a constitutive promoter can be used to express a 

polypeptide of the invention! 

It should be understood that the design of the expression vector may depend on 
such factors as the choice of the host cell to be transformed and/or the type of 
polypeptide desired to be expressed. Representative examples of appropriate hosts 

20 include bacterial cells, such as gram positive, gram negative cells; fungal cells, such as 
yeast cells and Aspergillus cells; insect cells such as Drosophila S2 and Spodoplera Sf9 
cells; animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, 293 and Bowes 
melanoma cells; and plant cells. 

In one embodiment, cells used to express heterologous polypeptides of the 

25 invention, comprise a mutation which renders one or more endogenous transcription 
factors, such as a AraC family polypeptide or a MarA family polypeptide, nonfunctional 
or causes one or more endogenous polypeptide to not be expressed. Manipulation of the 
genetic background in this maimer allows for screening for compounds that modulate 
specific transcription factors, such as MarA family members or AraC family members, 

30 or more than one transcription factors. 

In other embodiments, mutations may also be made in other related genes of the 
host cell, such that there will be no interference from the endogenous host loci. In yet 
another embodiment, a mutation may be made in a chromosomal gene to create a 
heterotroph. 

35 Introduction of a nucleic acid molecule into the host cell ("transformation*') can 

be effected by methods described in many standard laboratory manuals, such as Davis et 
al., Basic Methods In Molecular Biology, (1986) and Sambrook et al., Molecular 
Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold 
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Spring Harbor, N.Y. (1989). Examples include calcium phosphate transfection, DEAE- 
dextran mediated transfection, transvection, microinjection, cationic lipid-mediated 
transfection, electroporation, transduction, scrape loading, ballistic introduction and 
infection. 

5 Purification of polypeptides, e.g., recombinant^ expressed polypeptides, can be 

accomplished using techniques known in the art. For example, if the polypeptide is 
expressed in a form that is secreted from cells, the medium can be collected. 
Alternatively, if the polypeptide is expressed in a form that is retained by cells, the host 
cells can be lysed to release the polypeptide. Such spent medium or cell lysate can be 

1 0 used to concentrate and purify the polypeptide. For example, the medium or lysate can 
be passed over a column, e.g., a column to which antibodies specific for the polypeptide 
have been bound. Alternatively, such antibodies can be specific for a second 
polypeptide which has been fused to the first polypeptide {e.g., as a tag) to facilitate 
purification of the first polypeptide. Other means of purifying polypeptides are known in 

15 the art. 

TV. Methods for Identifying Antiinfective Compounds Which Modulate an Activity of a 
Transcription Factor 

Transcription factor agonists and antagonists can be assayed in a variety 

20 of ways. For example, in one embodiment, the invention provides for methods for 

identifying a compound which modulates an transcription factor, e.g., by measuring the 
ability of the compound to interact with an transcription factor nucleic acid molecule or 
an transcription factor polypeptide or the ability of a compound to modulate the activity 
or expression of an transcription factor polypeptide. Furthermore, the ability of a 

25 compound to modulate the binding of an transcription factor polypeptide or transcription 
factor nucleic acid molecule to a molecule to which they normally bind, e.g., a nucleic 
acid or protein molecule can be tested. 

In one embodiment, a transcription factor and its cognate DNA sequence 
can be present in a cell free system, e.g., a cell lysate and the effect of the compound on 

30 that interaction can be measured using techniques known in the art. 

In a preferred embodiment, the assay system is a cell-based system. 
Compounds identified using the subject methods are useful, e.g., to interfere with the 
ability of a microbe to grow in a host and/or in reducing microbial virulence and, 
thereby, and in reducing the ability of the microbe to cause infection in a host. 

35 The ability of the test compound to modulate the expression and/or 

activity of a transcription factor can be determined in a variety of ways. Exemplary 
methods which can be used in the instant assays are known in the art and are described, 
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e.g., in 5,817,793 and WO 99/61579. Other exemplary methods are described in more detail 
below. 

In one embodiment, the invention provides for methods of identifying a 
test compound which modulates the expression and/or activity of a transcription factor, 
5 (e.g., an HTH protein, a MarA family polypeptide, an AraC family polypeptide, etc.) by 
contacting a cell expressing a transcription factor (or portion thereof) with a test 
compound under conditions which allow interaction of the test compound with the cell. 

Assays 

10 In one embodiment, the expression of a selectable marker that confers a 

selective growth disadvantage or lethality is placed under the direct control of a MarA. 
responsive element in a cell expressing marA. 

In one embodiment, marA is plasmid encoded. In one embodiment, the 
genetic background of the host organism is manipulated, e.g., to delete one or more 

15 chromosomal marA genes or marA homolog genes. 

In one embodiment, expression of marA is controlled by a highly 
regulated and inducible promoter. For example, in one embodiment, a promoter selected 
from the group consisting of trp, tac, or tet in bacterial cells or GAL1 in yeast cells can 
be used. 

20 In another embodiment, expression of marA is constitutive. 

In one embodiment, a selective marker is a cytotoxic gene product (e.g., 

ccdB). 

In another embodiment, a selective marker is a gene that confers 
antibiotic resistance (e.g., kan, cat, or bla). 

25 In another embodiment, a selective marker is an essential gene (e.g., purA 

or guaB in a purine or guanine heterotroph). 

In still another embodiment, a selective marker is a gene that confers a 
selective growth disadvantage in the presence of a particular metabolic substrate (e.g., 
the expression of URA3 in the presence of 5-fluoroorotic acid [5-FOA] in yeast). 

30 In one embodiment, compounds that modulate transcription factors (e.g., 

HTH proteins, AraC family polypeptides, or MarA family polypeptides) are identified 
using a one-hybrid screening assay. As used herein, the term "one-hybrid screen' 5 as 
used herein includes assays that detect the disruption of protein-nucleic acid interactions. 
These assays will identify agents that interfere with the binding of a transcription factor 

35 (e.g., an HTH protein, a AraC family polypeptide, or a MarA family polypeptide) to a 
particular target, e.g., DNA containing, for MarA, a marbox, at the level of the target 
itself, e.g, by binding to the target and preventing the trnscriptional activation factor 
from interacting with or binding to this site. 
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In another embodiment, compounds of the invention are identified using a 
two-hybrid screening assay. As used herein the term "two-hybrid screen" as used herein 
includes assays that detect the disruption of protein-protein interactions. Such two 
hybrid assays can be used to interfere with crucial protein-transcription factor 
5 interactions (e.g., HTH protein interactions, AraC family polypeptide interactions, MarA 
family polypeptide interactions). One example would be to prevent RNA polymerase- 
MarA family polypeptide contacts, that are necessary for the MarA family polypeptide to 
function as a transcription factor (either positive acting or negative acting). i 
In yet another embodiment, compounds of the invention are identified 

10 using a three-hybrid screening assay. As used herein the term "three-hybrid screen" as 
used herein includes assays that will detect the disruption of a signal transduction 
pathways) required for the activation of a particular regulon of interest. In one 1 
embodiment, the three-hybrid screen is used to detect disruption of a signal transduction 
pathway(s) required for the activation of the Mar regulon, i.e., synthesis of MarA. (Li 

15 and Park. J, Bact 181 :4824). The assay can be used to identify compounds that may be 
responsible for activating transcription factor expression, e.g., Mar induction by 
antibiotics may proceed in this manner. 

In one embodiment of the assay, the expression of a selective marker I 
(e.g., ccdB, cat, bla, kan, guaB, URA3) is put under the direct control of an activatable ! 

20 MarA responsive activatable promoter (e.g., inaA, galT, micF). In the absence of Mar ! 
A, the expression of the selective marker would be silent. For example, in the case of 1 
regulation of the cytotoxic gene ccdB, the gene would be silent and the cells would 
survive. Synthesis of MarA from an inducible plasmid in a suitable host would result in 
the activation of the MarA responsive activatable promoter and expression of the J 

25 selective marker. In the case of ccdB, the gene would be expressed and result in cell 
death. Compounds that inhibit MarA would be identified as those that permit cell 
survival under conditions of MarA expression. 

In another embodiment, e.g., where the expression of the MarA 
responsive activatable promoter regulates a gene such as XJRA3, a different result could j 

30 be obtained. In this case, in the absence of MarA and thus, in the absence of URA3 
expression, cells would grow in the presence of a 5-FOA. Upon activation of MarA 
expression and thus synthesis of URA3, cells would die following the conversion of 5- 
FOA to a toxic metabolite by URA3. i 
In another embodiment, a selectable marker is put under the direct control 

35 of a repressible MarA responsive promoter (e.g. , fecA). In this example, under 

conditions of constitutive MarA synthesis, e.g., in a constitutive mar (marc) mutant the 
expression of the selectable marker would be silent. In the case of ccdB, this would 
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mean that cells would remain viable. Following inactivation of Mar A, the selectable 
marker would be turned on, resulting in cell death. 

In another embodiment, a purine or guanine heterotroph can be 
constructed by the inactivation of the chromosomal guaB or pur A genes in E. colu The 

5 guaB or pur A gene would then be cloned into a suitable vector, under the control of its 
natural promoter. This construct would then be transformed into the heterotrophic host. 
The heterotroph will not grow if MarA expression is constitutive and if cells are grown 
on media lacking purines or guanine. This can be attributed to MarA mediated 
repression of guaB or purA synthesis. Candidate inhibiting compounds of MarA can be 

10 identified as compounds that restored growth, i.e., relieved MarA mediated repression of 
guaB and purA expression. In another embodiment, genes that are required for growth 
in vivo, for example in an animal model of infection. 

In preferred embodiments, controls may be included to ensure that any 
compounds which are identified using the subject assays do not merely appear to 

1 5 modulate the activity of a transcription factor, because they inhibit protein synthesis. For 
example, if a compound appears to inhibit the synthesis of a protein being translated 
from RNA which is transcribed upon activation of a MarA family responsive element, it 
may be desirable to show that the synthesis of a control, e.g., a protein which is being 
translated from RNA which is not transcribed upon activation of a MarA family 

20 responsive element, is not affected by the addition of the same compound. For example, 
the amount of the MarA family polypeptide being made and compared to the amount of 
an endogenous protein being made. In another embodiment the microbe could be 
transformed with another plasmid comprising a promoter which is not a MarA family 
responsive promoter and a protein operably linked to that promoter. The expression of 

25 the control protein could be used to normalize the amount of protein produced in the 
presence and absence of compound. 

V. Microbes Suitable For Testing 

Numerous different microbes are suitable for testing in the instant assays. 
30 As such, they may be used as intact cells or as sources of material, eg., nucleic acid 
molecules or polypeptides as described herein. 

In preferred embodiments, microbes for use in the claimed methods are 
bacteria, either Gram negative or Gram positive bacteria. More specifically, any bacteria 
that are shown to become resistant to antibiotics , e.g., to display a Mar phenotype are 
35 preferred for use in the claimed methods, or that are infectious or potentially infectious. 

Examples of microbes suitable for testing include, but are not limited to, 
Pseudomonas aeruginosa, Pseudomonas fluorescens, Pseudomonas acidovorans 9 
Pseudomonas alcaligenes, Pseudomonas putida, Stenotrophomonas maltophilia, 
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Burkliolderia cepacia, Aeromonas hydrophilia, Escherichia coli, Citrobacter jreundii, 
Salmonella typhimurium, Salmonella typhi. Salmonella paratyphi, Salmonella 
enteritidis, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, Enterobacter 
cloacae, Enterobacter aerogenes, Klebsiella pneumoniae, Klebsiella oxytoca, Serratia 
marcescens, Francisella tularensis, Morganella morganii, Proteus mirabilis, Proteus 
vulgaris, Providencia alcalifaciens, Providencia rettgeri, Providencia stuartii, 
Acinetobacter calcoaceticus, Acinetobacter haemolyticus, Yersinia enterocolitica, 
Yersinia pestis, Yersinia pseudotuberculosis, Yersinia intermedia, Bordetella pertussis, 
Bordetella parapertussis, Bordetella bronchiseptica, Haemophilus influenzae, 
Haemophilus parainfluenzae, Haemophilus haemolyticus, Haemophilus 
parahaemolyticus, Haemophilus ducreyi, Pasteurella multocida, Pasteurella 
haemolytica, Branhamella catarrhalis, Helicobacter pylori, Campylobacter fetus, 
Campylobacter jejuni, Campylobacter coli, Borrelia burgdorferi, Vibrio cholerae, 
Yibrio parahaemolyticus, Legionella pneumophila, Listeria monocytogenes, Neisseria 
gonorrhoeae, Neisseria meningitidis, Gardnerella vaginalis, Bacteroides fragilis, 
Bacteroides distasonis, Bacteroides 3452A homology group, Bacteroides vulgatus, 
Bacteroides ovalus, Bacteroides thetaiotaomicron, Bacteroides uniformis, Bacteroides 
eggerthii, Bacteroides splanchnicus, Clostridium difficile, Mycobacterium tuberculosis, 
Mycobacterium avium, Mycobacterium intracellulare, Mycobacterium leprae, 
Corynebacterium diphtheriae, Corynebacterium ulcerans, Streptococcus pneumoniae, 
Streptococcus agalactiae, Streptococcus pyogenes, Enterococcus faecalis, Enterococcus 
faecium, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus 
saprophytics, Staphylococcus intermedius, Staphylococcus hyicus subsp. hyicus, 
Staphylococcus haemolyticus, Staphylococcus hominis, and Staphylococcus 
saccharolyticus. 

In one embodiment, microbes suitable for testing are bacteria from the 
family Enterobacteriaceae. In preferred embodiments, the compound is effective 
against a bacteria of a genus selected from the group consisting of: Escherichia, Proteus, 
Salmonella, Klebsiella, Providencia, Enterobacter, Burkholderia, Pseudomonas, 
Aeromonas, Haemophilus, Yersinia, Neisseria, and Mycobacteria. 

In yet other embodiments, the microbes to be tested are Gram positive 
bacteria and are from a genus selected from the group consisting of: Lactobacillus, 
Azorhizobium, Streptomyces, Pediococcus, Photobacterium, Bacillus, Enterococcus, 
Staphylococcus, Clostridium, and Streptococcus. 

In other embodiments, the microbes to be tested are fungi. In a preferred 
embodiment the fungus is from the genus Mucor or Candida, e.g., Mucor racmeosus or 
Candida albicans. 
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In yet other embodiments, the microbes to be tested are protozoa. In a 
preferred embodiment the microbe is a malaria or Cryptosporidium parasite. 

VI. Transcription factor Modulating Compounds and Test Compounds 
5 Compounds for testing in the instant methods can be derived from a 

variety of different sources and can be known or can be novel. In one embodiment, 
libraries of compounds are tested in the instant methods to identify transcriptional 
activation factor modulating compounds, e.g., HTH protein modulating compounds, 
AraC family polypeptide modulating compounds, MarA family polypeptide modulating 

10 compounds, etc. In another embodiment, known compounds are tested in the instant 
methods to identify transcription factor modulating compounds (such as, for example, 
HTH protein modulating compounds, AraC family polypeptide modulating compounds, 
MarA family polypeptide modulating compounds, etc.). hi an embodiment, compounds 
among the list of compounds generally regarded as safe (GRAS) by the Environmental 

15 Protection Agency are tested in the instant methods. In another embodiment, the 
transcription factors which are modulated by the modulating compounds are of 
prokaryotic microbes. 

A recent trend in medicinal chemistry includes the production of mixtures 
of compounds, referred to as libraries. While the use of libraries of peptides is well 

20 established in the art, new techniques have been developed which have allowed the 

production of mixtures of other compounds, such as benzodiazepines (Bunin et al. 1992. 
J. Am. Chem. Soc. 114:10987; DeWitt et al. 1993. Proc. Natl. Acad. Sci. USA 90:6909) 
peptoids (Zuckermann. 1994. J. Med. Chem. 37:2678) oligocarbamates (Cho et al. 
1993. Science. 261:1303), and hydantoins (DeWitt etal. supra). Rebek et al. have 

25 described an approach for the synthesis of molecular libraries of small organic molecules 
with a diversity of 104-105 (Carell et al. 1994. Angew. Chem. Int. Ed. Engl 33:2059; 
Carell et al. Angew. Chem. Int. Ed. Engl. 1994. 33:2061). 

The compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 

30 biological libraries; spatially addressable parallel solid phase or solution phase libraries, 
synthetic library methods requiring deconvolution, the f one-bead one-compound 1 library 
method, and synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 

35 of compounds (Lam, K.S. Anticancer Drug Des. 1997. 12:145). 

Exemplary compounds which can be screened for activity include, but are 
not limited to, peptides, nucleic acids, carbohydrates, small organic molecules, and 
natural product extract libraries, hi one embodiment, the test compound is a peptide or 
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peptidomimetic. In another, preferred embodiment, the compounds are small, organic 
non-peptidic compounds. 

Other exemplary methods for the synthesis of molecular libraries can be 
found in the art, for example in: Erb et al. 1994. Proc. Natl. Acad. Sci. USA 91 :1 1422; 
5 Horwell et al. 1996 Jmmunopharmacology 33:68; and in Gallop et al. 1994. J. Med. 
Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten 
(1992) Biotechniques 1 3:412-421), or on beads (Lam (1991) Nature 354:82-84), chips 
(Fodor (1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner 

10 USP '409), plasmids (Cull fet al. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on 
phage (Scott and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404- 
406); (Cwirla et al. (1990) Proc. Natl Acad Sci. 87:6378-6382); (Felici (1991) J. Mol 
Biol 222:301-310); (Ladner supra.). Other types of peptide libraries may also be 
expressed, see, for example, U.S. Patents 5,270,181 and 5,292,646). In still another 

15 embodiment, combinatorial polypeptides can be produced from a cDNA library. 

In other embodiments, the compounds can be nucleic acid molecules. In 
preferred embodiments, nucleic acid molecules for testing are small oligonucleotides. 
Such oligonucleotides can be randomly generated libraries of oligonucleotides or can be 
specifically designed to reduce the activity of a transcription factor, e.g., a HTH protein, 

20 a MarA family polypeptide, or an AraC family polypeptide. For example, in one 
embodiment, these oligonucleotides are sense or antisense oligonucleotides. In an 
embodiments, oligonucleotides for testing are sense to the binding site of a particular 
transcription factor, e.g., a MarA family polypeptide helix-turn-helix domain. Methods 
of designing such oligonucleotides given the sequences of a particular transcription 

25 factor polypeptide, such as a MarA family polypeptide, is within the skill of the art. 

In yet another embodiment, computer programs can be used to identify 
individual compounds or classes of compounds with an increased likelihood of 
modulating a transcription factor activity, e.g., an HTH protein, a AraC family 
polypeptide, or a MarA family polypeptide activity. Such programs can screen for 

30 compounds with the proper molecular and chemical complementarities with a chosen 
transcription factor. In this manner, the efficiency of screening for transcription factor 
modulating compounds in the assays described above can be enhanced. 

VJL Computer Modeling Techniques for Identifying Transcription factor Modulating 
35 Compounds 

The invention also pertains to the use of molecular design techniques to 
design transcription factor modulating compounds, e.g., HTH protein modulating 
compounds, AraC family modulating compounds, MarA family modulating compounds, 
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or MarA modulating compounds, which are capable of binding or interacting with one or 
more transcription factors (e.g., of a prokaryotic or eukaryotic organism). The invention 
pertains to both the transcription factor modulating compounds identified by the methods 
as well as the modeling methods, and compositions comprising the compounds 

5 identified by the methods. 

In an embodiment, the invention pertains to a method of identifying 
transcription factor modulating compounds. The method includes obtaining the structure 
of a transcription factor of interest, and using GLIDE to identify a scaffold which has an 
interaction energy score of -20 or less (e.g., -40 or less, e.g., -60 or less) with a portion 

10 of the transcription factor. 

The invention pertains, at least in part, to a computational screening of small 
molecule databases for chemical entities or compounds that can bind in whole, or in part, 
to a transcription factor, such as a HTH protein, an AraC family polypeptide, a MarA 
family polypeptide, e.g., MarA. In this screening, the quality of fit of such entities or 

1 5 compounds to the binding site may be judged either by shape complementarity or by 
estimated interaction energy (Meng, E. C. et aL, 1992, J. Coma. Chem., 13:505-524). 
Such a procedure allows for the screening of a very large library of potential 
transcription factor modulating compounds for the proper molecular and chemical 
complementarities with a selected protein or class or proteins. Transcription factor 

20 modulating compounds identified through computational screening can later be passed 
through the in vivo assays described herein as further screens. For example, a MarA 
inhibiting compound identified through computational screening could be tested for its 
ability to promote cell survival in a cell system containing a counterselectable marker 
under the control a MarA activated promoter. The promotion of cell survival in the 

25 foregoing assay would be indicative of a compound that inhibits MarA's activity as a 
transcriptional activator. Other suitable assays are described in the Examples and 
through the specification. 

The crystal structures of both MarA (PDB ID code 1BL0) and its homolog Rob 
(PDB ID code 1DY5) are available in the Protein Data Bank (http://www.rcsb.org/pdbA . 

30 These structures were used to identify sites on the proteins that could be targeted by 
small molecule chemical inhibiting compounds. A total of at least eight potential small 
molecule binding sites on MarA (Table 2) and four sites on Rob (Table 3) were 
identified as potential small molecule binding sites. The invention pertains, at least in 
part, to MarA modulating compounds which interact with any one of the following sites 

35 of MarA (based on the sequence given in SEQ ID NO. 2). 
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Table 2 



Site Number 


Residues (based on full length MarA) 


Site Label 


1 


42 to 50 


R46 Major Groove 


2 


54 to 62 


T C£. T'I'I'TT ^ 

L56 HTH core 


3 


« f/v A<\ 
J J tO 03 


Koi Minor ijrroove 


4 


15 to 25 


W19 


5 


14 to 25 


E21 


6 


24 to 35 


L28 


7 


76 to 83 


P78 


8 


106 to 112 


R110 



The GLIDE docking method was then used to fit combinatorial chemistry 
scaffolds into these sites and an interaction energy was calculated for each. Eight 
5 scaffolds were predicted to bind to site 1, encompassing amino acids tryptophan 42 to 
lysine 50, with an interaction energy score of --60 or less. These scaffolds are shown 
below: 





10 Three scaffolds were identified for site 2 of MarA (e.g., residues histidine 54 to 

serine 62). 
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Four scaffolds were identified for MarA site 3, (e.g., residues serine 55 to 



methionine 65): 




' Six scaffolds were identified for site 6 (e.g., residues leucine 24 to glutamate 35). 




5 These scaffolds were then used to search the CambridgeSoft ACX-SC database 

of over 600,000 non-proprietary chemical structures and the number of chemicals similar 
to the scaffolds was determined. 

The term "scaffold" includes the compounds identified by the computer 
modeling program. These compounds may or may not be themselves transcription factor 

10 modulating compounds. An ordinarily skilled artisan will be able to analyze a scaffold 
obtained from the computer modeling program and modify the scaffold such that the 
resulting compounds have enhanced chemical properties over the initial scaffold 
compound, e.g., are more stable for administration, less toxic, have enhanced affinity for 
a particular transcription factor, etc. The invention pertains not only to the scaffolds 

15 identified, but also the transcription factor modulating compounds which are developed 
using the scaffolds. 

Table 3 lists portions of Rob which were identified as possible interaction sites 
for a modulating compound. The invention pertains, at least in part, to any compounds 
modeled to bind to these regions of Rob. The numbering corresponds to that given in 
20 SEQ ID NO. 4. 
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Table 3 



Site Number 


Residues (based on full length Rob) 


Site Label 


1 


F 37 to 45 


R40 Major Groove 


2 


43 to 54 


150 HTH Core 


3 


51 to 60 


R55 Minor Groove 


4 


10 to 20 


W13 



These scaffolds were identified as possible modulating compounds which with 
site 1 of Rob (residues 37-45), aMarA family polypeptide. 

5 




These scaffolds were identified as small molecules that may interact with site 2 
of Rob (residues 43-52), a MarA family polypeptide. ^ 
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The design of compounds that bind to, modulate, or inhibit transcription factors, 
generally involves consideration of two factors. First, the compound must be capable of 
physically and structurally associating with a particular transcription factor. Non- 
5 covalent molecular interactions important in the association of a transcription factor with 
a modulating compound include hydrogen bonding, van der Waals and hydrophobic 
interactions. 

Second, the modulating compound must be able to assume a conformation that 
allows it to associate with the selected transcription factor. Although certain portions of 

10 the inhibiting compound will not directly participate in this association with the 

transcription factor, those portions may still influence the overall conformation of the 
molecule. This, in turn, may have a significant impact on potency. Such conformational 
requirements include the overall three-dimensional structure and orientation of the 
chemical entity or compound in relation to all or a portion of the binding site, e.g., active 

1 5 site or accessory binding site of a particular transcription factor such as MarA, or the 
spacing between functional groups of a compound comprising several chemical entities 
that directly interact with the particular transcription factor. 

In a further embodiment, the potential modulating effect of a chemical compound 
on a selected transcription factor (e.g., a HTH protein, a AraC family polypeptide, a 

20 MarA family polypeptide, e.g., MarA) is analyzed prior to its actual synthesis and testing 
by the use of computer modeling techniques. If the theoretical structure of the given 
compound suggests insufficient interaction and association between it and the selected 
transcription factor, synthesis and testing of the compound is avoided. However, if 
computer modeling indicates a strong interaction, the molecule may then be synthesized 
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and tested for its ability to bind to the selected transcription factor and modulate the 
transcription factor's activity. 

A transcription factor modulating compound or other binding compound (e.g., an 
HTH protein modulating compound, an AraC family polypeptide modulating compound, 
5 or a MarA family inhibiting compound, e.g., a MarA inhibiting compound) may be 
computationally evaluated and designed by screening and selecting chemical entities or 
fragments for their ability to associate with the individual small molecule binding sites 
or other areas of a transcription factor. 

One skilled in the art may use one of several methods to screen chemical entities 

10 or fragments for their ability to associate with a selected transcription factor and more 
particularly with the individual small molecule binding sites of the particular 
transcription activation factor. This process may begin by visually inspecting the 
structure of the transcription factor on a computer screen based on the atomic 
coordinates of the transcription factor crystals. Selected chemical entities may then be 

15 positioned in a variety of orientations, or docked, within an individual binding site of the 
transcription factor. Docking may be performed using software such as Quanta and 
Sybyl, followed by energy minimization with standard molecular mechanics forcefields 
or dynamics with programs such as CHARMM (Brooks, B. R. et al., 1983, J. Comp. 
Chem., 4:187-217) or AMBER (Weiner, S. J. et al., 1984, J. Am. Chem. Soc, 106:765- 

20 784). 

Specialized computer programs may also assist in the process of selecting 
molecules that bind to a selected transcription factor, (e.g., an HTH protein, an AraC 
family polypeptide, or a MarA family polypeptide, e.g., MarA). The programs include, 
but are not limited to: 
25 1. GRID (Goodford, P. J., 1985, "A Computational Procedure for 

Determining Energetically Favorable Binding Sites on Biologically 
Important Macromolecules" J. Med. Chem., 28:849-857 GRID is 
available from Oxford University, Oxford, UK. 

30 2. AUTODOCK (Goodsell, D. S. and A. J. Olsen, 1990, "Automated 

Docking of Substrates to Proteins by Simulated Annealing" Proteins: 
Structure. Function, and Genetics, 8:195-202. AUTODOCK is available 
from Scripps Research Institute, La JoDa, Calif. AUTODOCK helps in 
docking inhibiting compounds to a selected transcription factor in a 

35 flexible manner using a Monte Carlo simulated annealing approach. The 

procedure enables a search without bias introduced by the researcher. 
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3. MCSS (Miranker, A. and M. Karplus, 1991, "Functionality Maps of 
Binding Sites: A Multiple Copy Simultaneous Search Method." Proteins: 
Structure, Function and Genetics, 11:29-34). MCSS is available from 
Molecular Simulations, Burlington, Mass. 

5 

4. MACCS-3D (Martin, Y. C, 1992, J. Med. Chem., 35:2145-2154) is a 3D 
database system available from MDL Information Systems, San Leandro, 
Calif. 

DOCK (Kuntz, I. D. et aL $ 1982, "A Geometric Approach to 
Macromolecule-Ligand Interactions" J. Mol Biol, 161:269-288). DOCK 
is available from University of California, San Francisco, Calif. 
DOCK is based on a description of the negative image of a space-filling 
representation of the molecule (i.e. the selected transcription factor) that 
should be filled by the inhibiting compound. DOCK includes a force-field 
for energy evaluation, limited conformational flexibility and 
consideration of hydrophobicity in the energy evaluation. 

MCDLNG (Monte Carlo De Novo Ligand Generator) (D. K. Gehlhaar, et 
al 1995. J. Med. Chem. 38:466-472). MCDLNG starts with a structure 
(i.e. an X-ray crystal structure) and fills the binding site with a close 
packed array of generic atoms. A Monte Carlo procedure is then used to 
randomly: rotate, move, change bond type, change atom type, make 
atoms appear, make bonds appear, make atoms disappear, make bonds 
disappear, etc. The energy function used by MCDLNG favors the 
formation of rings and certain bonding arrangements. Desolvation 
penalties are given for heteroatoms, but heteroatoms can benefit from 
hydrogen bonding with the binding site. 

30 In an embodiment of the invention, docking is performed by using the Affinity 

program within IhsightH (Molecular Simulations Inc., 1996, San Diego, Calif., now 
Accelrys Inc.). Affinity is a suite of programs for automatically docking a ligand (i.e. a 
transcription factor modulating compound) to a receptor (i.e. a transcription factor). 
Commands in Affinity automatically find the best binding structures of the ligand to the 

35 receptor based on the energy of the ligand/receptor complex. As described below, 
Affinity allows for the simulation of flexible-flexible docking. 



10 5. 



15 



20 



25 
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Affinity consists of two commands, GridDocking and fixedDocking, under the 
new pulldown Affinity in the Docking module of the Insight II program. Both 
commands use the same, Monte Carlo type procedure to dock a guest molecule (i.e. 
HTH protein modulating compound) to a host (i.e., a transcription factor). They also 

5 share the feature that the "bulk" of the receptor (i.e. transcription factor), defined as 
atoms not in the binding (active) site specified, is held rigid during the docking process, 
while the binding site atoms and ligand atoms are movable. The commands differ, 
however, in their treatment of nonbond interactions. In GridDocking, interactions 
between bulk and movable atoms are approximated by the very accurate and efficient 

1 0 molecular mechanical/grid (MM/Grid) method developed by Luty et al 1 995. Comp. 
Chem. 16:454, while interactions among movable atoms are treated exactly. 
GridDocking also includes the solvation method of Stouten et al 1993. Molecular 
Simulation 10:97. On the other hand, the fixedDocking command computes nonbond 
interactions using methods in the Discover program (cutoff methods and the cell 

1 5 multipole method) and it does not include any solvation terms. 

Affinity does not, generally, require any intervention from the user during the 
docking. It automatically moves the ligand (i.e. modulating compound), evaluates 
energies, and checks if the structure is acceptable. Moreover, the ligand and the binding 
site of the receptor (i.e. the selected transcription modulator) are flexible during the 

20 search. 

Most of the docking methods in the literature are based on descriptors or 
empirical rules (for a review see Kuntze/a/. 1994.^cc. Chem.Res. 27:117. These 
include DOCK (Kuntz et al. 1982. J. Mol Biol. 161:269., Shoichet et al 1992. J. 
Compt Chem. 13:380., Oshiro et al 1995. J. Comp. Aided Molec. Design 9:1 13.), 

25 CAVEAT (Bartlett et al 1989. "Chemical and Biological Problems in Molecular 
Recognition" Royal Society of Chemistry: Cambridge, pp. 182-196., Lauri & Bartlett. 
1994 J. Comput Aided Mol Design 8:51), FLOG (Miller et al 1994. J. Comp. Aided 
Molec. Design 8:153), and PRO_LIGAND (Clark et al 1995. J. Comp. Aided Molec. 
Design 9:13), to name a few. Affinity differs from these methods in several aspects. 

30 First, it uses full molecular mechanics in searching for and evaluating docked 

structures. In contrast descriptor-based methods use empirical rules which usually take 
into account only hydrogen bonding, hydrophobic interactions, and steric effects. This 
simplified description of ligand/receptor interaction is insufficient in some cases. For 
example, Meng et al 1992. J. Compt. Chem. 13:505 studied three scoring methods in 

35 evaluating docked structures generated by DOCK. They found that only the forcefield 
scores from molecular mechanics correctly identify structures closest to experimental 
binding geometry, while scoring functions that consider only steric factors or only 
electrostatic factors are less successful. Note that in the study by Meng et al 1992./. 
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Compt Chem. 13:505, docking was still performed using descriptors. Affinity, on the 
other hand, uses molecular mechanics in both docking and scoring and is therefore more 
consistent. 

Second, in Affinity, while the bulk of the receptor is fixed, the defined binding 

5 site is free to move, thereby allowing the receptor to adjust to the binding of different 
ligands or different binding modes of the same ligand. By contrast, almost all of the 
descriptor-based methods fix the entire receptor. 

Third, the ligand itself is flexible in Affinity which permits different 
conformations of a ligand (i.e. transcription factor modulating compound) to be docked 

10 to a receptor (i.e. transcription factor). Recently Oshiro et al (1995 J. Comp. Aided 
Molec. Design 9;1 13) extended DOCK to handle flexible ligands. FLOG is also able to 
treat flexible ligand by including different conformations for each structure in the 
database (Miller et al. 1995. /. Comp. Aided Molec. Design. 8:153). Most other methods 
are limited to rigid ligands. 

15 There are also a few energy based docking methods (Kuntz et al 1994. Acc. 

Chem Res. 27: 1 17). These methods use either molecular dynamics (notably simulated 
annealing) or Monte Carlo methods. For example, Caflisch et al 1992. Proteins: Struct. 
Funct and Genetics 13:223) developed a two step procedure for docking flexible 
ligands. In their procedure, ligand is first docked using a special energy function 

20 designed to remove bad contact between the ligand and the receptor efficiently. Then 
Monte Carlo minimization (Li & Scheraga. 1987. Proc. Natl Acad. Sci. U.S.A. 84:6611) 
is carried out to refine the docked structures using molecular mechanics. Hart and Read. 
1992. Proteins: Struct. Funct and Genetics 13:206 also employ two steps to dock 
ligands. They use a score function based on receptor geometry to approximately dock 

25 ligands in the first step, and then use Monte Carlo minimization similar to that of 
Caflisch et al. 1992. Proteins: Struct. Funct. and Genetics 13:223 for the second step. 
The method by Mizutani et al (1994. J. Mol Biol 243:310) is another variation of this 
two step method. 

Affinity uses a Monte Carlo procedure in docking ligands, but there are 
30 important distinctions over the prior art methods. First, the Monte Carlo procedure in 
Affinity can be used in conjunction either with energy minimization (to mimic the 
Monte Carlo minimization method of Li & Scheraga. 1987. Proc. Natl Acad. Sci. U.S.A. 
84:661 1) or with molecular dynamics (to mimic the hybrid Monte Carlo method, Clamp 
et al. 1994. J. Comput. Chem. 15:838, or the smart Monte Carlo method, Senderowitz et 
35 al. 1995. J. Am. Chem. Soc. 1 17:821 1). This flexibility allows Affinity to be applied to a 
variety of docking problems. Second, in the initial screening of docked structures, 
Affinity employs energy differences obtained from molecular mechanics, while the 
methods discussed above use empirical rules or descriptors. Therefore, Affinity is more 
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consistent in that it uses molecular mechanics in both initial screening and final 
refinement of docked structures. Third, Affinity allows the binding site of the receptor 
to relax, while the methods discussed above fix the entire receptor. Fourth, Affinity 
employs two new nonbond techniques which are both accurate and efficient to make 

5 docking practical. One is the Grid/MM method of Luty et al which represents the bulk 
of the receptor by grids (Luty et al 1995. Comp. Chem. 16:454). This method is 10-20 
times faster than the no-cutoff method with almost no loss in accuracy. It also 
incorporates the solvation method of Stouten et al (1993. Molecular Simulation 10:97). 
The other is the cell multipole method. This method is about 50% slower than the 

10 Grid/MM method, but it does not require grid setup. Thus, a typical docking calculation 
takes about 1-3 hours of CPU time on an Indigo R4400 workstation. 

Once suitable chemical fragments have been selected, they can be assembled into 
a single compound or inhibiting compound. Assembly may be proceed by visual 
inspection of the relationship of the fragments to each other on a three-dimensional 

1 5 image display on a computer screen in relation to the structure coordinates of a particular 
transcription factor, e.g., MarA. This may be followed by manual model building using 
software such as Quanta or Sybyl. 

Useful programs to aid one of skill in the art in connecting the individual 
chemical fragments include: 

20 

1 . 3D Database systems such as MACCS-3D (MDL Information Systems, 
San Leandro, Calif. This area is reviewed in Martin, Y. C, 1992, "3D 
Database Searching in Drug Design", J. Med. Chem., 35, pp. 2145-2154). 

25 2. CAVEAT (Bartlett, P. A. et al, 1989, "CAVEAT: A Program to Facilitate 

the Structure-Derived Design of Biologically Active Molecules". In 
Molecular Recognition in Chemical and Biological Problems", Special 
Pub., Royal Chem. Soc, 78, pp. 182-196). CAVEAT is available from 
the University of California, Berkeley, Calif. CAVEAT suggests 

30 inhibiting compounds to MarA based on desired bond vectors. 

3. HOOK (available from Molecular Simulations, Burlington, Mass.). 
HOOK proposes docking sites by using multiple copies of functional 
groups in simultaneous searches. 



35 



In another embodiment, transcription factor modulating compounds may be 
designed as a whole or "de novo" using either an empty active site or optionally 
including some portion(s) of a known inhibiting compound(s). These methods include: 
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1 . LUDI (Bohm, EL-J., "The Computer Program LUDI: A New Method for 
the De Novo Design of Enzyme Inhibiting compounds", J. ComJL Aid. 
Molec. Design, 6, pp. 61-78 (1992)). LUDI is available from Biosym 

5 Technologies, San Diego, Calif. LUDI is a program based on fragments 

rather than on descriptors. LUDI proposes somewhat larger fragments to 
match with the interaction sites of a macromolecule and scores its hits 
based on geometric criteria taken from the Cambridge Structural 
Database (CSD), the Protein Data Bank (PDB) and on criteria based on 

1 o binding data. LUDI is a library based method for docking fragments onto 

a binding site. Fragments are aligned with 4 directional interaction sites 
(lipophilic-aliphatic, lipophilic-aromatic, hydrogen donor, and hydrogen 
acceptor) and scored for their degree of overlap. Fragments are then 
connected (i.e. a linker of the proper length is attached to each terminal 

1 5 atom in the fragments). Note that conformational flexibility can be 

accounted for only by including multiple conformations of a particular 
fragment in the library. 

2. LEGEND (Nishibata, Y. and A. Itai, Tetrahedron, 47, p. 8985 (1991)). 
20 LEGEND is available from Molecular Simulations, Burlington, Mass. 

3. CoMFA (Conformational Molecular Field Analysis) (J. J. Kaminski. 
1994. Adv. Drug Delivery Reviews 14:331-337.) CoMFA defines 3- 
dimensional molecular shape descriptors to represent properties such as 

25 hydrophobic regions, sterics, and electrostatics. Compounds from a 

database are then overlaid on the 3D pharmacophore model and rated for 
their degree of overlap. Small molecule databased that be searched 
include: ACD (over 1,000,000 compounds), Maybridge (about 500,000 
compounds), NCI (about 500,000 compounds), and CCSD. In measuring 

30 the goodness of the fit, molecules can either be fit to the 3D molecular 

shape descriptors or to the active conformation of a known inhibiting 
compound. 



35 



4. LeapFrog (available from Tripos Associates, St. Louis, Mo.). 

FlexX (© 1993-2002 GMD German National Research Center for 
Information Technology; Rarey, M. et al J. Mol Biol y 261 :407-489) is a fast, flexible 
docking method that uses an incremental construction algorithm to place ligands into 
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and active site of the transcription factor. Ligands (e.g., transcription factor modulating 
compounds) that are capable of "fitting" into the active site are then scored according to 
any number of available scoring schemes to determine the quality of the complimentarity 
between the active site and ligand. 
5 Other molecular modeling techniques may also be employed in accordance with 

this invention. See, e.g., Cohen, N. C. et al., "Molecular Modeling Software and 
Methods for Medicinal Chemistry, J. Med. Chem., 33, pp. 883-894 (1990). See also, 
Navia, M. A. and M. A. Murcko, "The Use of Structural Information in Drug Design", 
Current Opinions in Structural Biology, 2, pp. 202-210 (1992). 

10 Candidate transcription factor modulating compounds can be evaluated for their 

modulating, e.g., inhibitory, activity using conventional techniques which may involve 
determining the location and binding proximity of a given moiety, the occupied space of 
a bound inhibiting compound, the deformation energy of binding of a given compound 
and electrostatic interaction energies. Examples of conventional techniques useful in the 

15 above evaluations include, but are not limited to, quantum mechanics, molecular 

dynamics, Monte Carlo sampling, systematic searches and distance geometry methods 
(Marshall, G. R., 1987, Ann. Ref. Pharmacol. Toxicol., 27:193). Examples of computer 
programs for such uses include, but are not limited to, Gaussian 92, revision E2 
(Gaussian, Inc. Pittsburgh, Pennsylvania), AMBER version 4.0 (University of California, 

20 San Francisco), QUANTA/CHARMM (Molecular Simulations, Inc., Burlington, Mass.), 
and Insight D/Discover (Biosym Technologies Inc., San Diego, Calif.). These programs 
may be implemented, for example, using a Silicon Graphics Indigo2 workstation or IBM 
RISC/6000 workstation model 550. Other hardware systems and software packages will 
be known and of evident applicability to those skilled in the art. 

25 Once a compound has been designed and selected by the above methods, the 

efficiency with which that compound may bind to a particular transcription factor may 
be tested and optimized by computational evaluation. An effective transcription factor 
modulating compound should demonstrate a relatively small difference in energy 
between its bound and free states (i.e., a small deformation energy of binding). 

30 Transcription factor modulating compounds may interact with the selected transcription 
factor in more than one conformation that is similar in overall binding energy. In those 
cases, the deformation energy of binding may be taken to be the difference between the 
energy of the free compound and the average energy of the conformations observed 
when the inhibiting compound binds to the enzyme. 

35 A compound designed or selected as interacting with a selected transcription 

factor, e.g., a MarA family polypeptide, e.g., MarA, may be further computationally 
optimized so that in its bound state it would preferably lack repulsive electrostatic 
interaction with the target enzyme. Such non-complementary (e.g., electrostatic) 
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interactions include repulsive charge-charge, dipokrdipole and charge-dipple 
interactions. Specifically, the sum of all electrostatic interactions between the 
modulating compound and the enzyme when the modulating compound is bound to the 
selected transcription factor, preferably make a neutral or favorable contribution to the 

5 enthalpy of binding. 

Specific computer software is available in the art to evaluate compound 
deformation energy and electrostatic interaction. Examples of programs designed for 
such uses include: Gaussian 92, revision C [M. J. Frisch, Gaussian, Inc., Pittsburgh, Pa. 
©1992]; AMBER, version 4.0 [P. A. Kollman, University of California at San 

10 Francisco, ©1994]; QUANTA/CHARMM [Molecular Simulations, Inc., Burlington, 
Mass. ©1994]; and Insight IMDiscover (Biosysm Technologies Inc., San Diego, Calif. 
©1994). These programs may be implemented, for instance, using a Silicon Graphics 
workstation, IRIS 4D/35 or IBM RISC/6000 workstation model 550. Other hardware 
systems and software packages will be known to those skilled in the art. 

15 Once a transcription factor modulating compound has been optimally selected or 

designed, as described above, substitutions may then be made in some of its atoms or 
side groups in order to improve or modify its binding properties. Initial substitutions are 
preferable conservative, i.e., the replacement group will have approximately the same 
size, shape, hydrophobicity and charge as the original group. Substitutions known in the 

20 art to alter conformation should be avoided. Such substituted chemical compounds may 
then be analyzed for efficiency of fit to the selected transcription factor by the same 
computer methods described above. 

Computer programs can be used to identify unoccupied (aqueous) space between 
the van der Waals surface of a compound and the surface defined by residues in the 

25 binding site. These gaps in atom-atom contact represent volume that could be occupied 
by new functional groups on a modified version of the lead compound. More efficient 
use of the unoccupied space in the binding site could lead to a stronger binding 
compound if the overall energy of such a change is favorable. A region of the binding 
pocket which has unoccupied volume large enough to accommodate the volume of a 

30 group equal to or larger than a covalently bonded carbon atom can be identified as a 
promising position for functional group substitution. Functional group substitution at 
this region can constitute substituting something other than a carbon atom, such as 
oxygen. If the volume is large enough to accommodate a group larger than a carbon 
atom, a different functional group which would have a high likelihood of interacting 

35 with protein residues in this region may be chosen. Features which contribute to 

interaction with protein residues and identification of promising substitutions include 
hydrophobicity, size, rigidity and polarity. The combination of docking, Ki estimation, 
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and visual representation of sterically allowed room for improvement permits prediction 
of potent derivatives. 

Similarity Screening 

5 Once a transcription factor modulating compound has been selected or 

designed, computational methods to assess its overall likeness or similarity to other 
molecules can be used to search for additional compounds with similar biochemical 
behavior. Ih such a way, for instance, HTS derived hits can be tested to assure that they 
are bona fide ligands against a particular active site, and to eliminate the possibility that 

10 a particular hit is an artifact of the screening process. There are currently several 

methods and approaches to determine a particular compound's similarity to members of 
a virtual database of compounds. One example is the OPTISIM methodology that is 
distributed in the Tripos package, SYBYL (© 1991-2002 Tripos, Inc., St. Louis, MO). 
OPTISIM exploits the fact that each 3-dimensional representation of a molecule can be 

15 broken down into a set of 2-dimensional fragments and encoded into a pre-defined 

binary string. The result is that each compound within a particular set is represented by a 
unique numerical code or fingerprint that is amenable to mathematical manipulations 
such as sorting and comparison. OPTISIM is automated to calculate and report the 
percent difference in the fingerprints of the respective compounds for instance according 

20 to the using a formalism known as the Tanimoto coefficient. For instance, a compound 
that is similar in structure to another will share a high coefficient Large virtual 
databases of commercially available compounds or of hypothetical compounds can be 
quickly screened to identify compounds with high Tanimoto coefficient. 

25 CoMFA/OSAR 

Once a series of similar transcription factor modulating compounds has 
been identified and expanded by the methods described, their experimentally determined 
biological activities can be correlated with their structural features using a number of 
available statistical packages. In a typical project within the industry, the CoMFA 

30 (Comparative Molecular Field Analysis) and QSAR (Quantitative Structure Activity 
Relationship) packages within the SYBYL suite of programs (Tripos Associates, St. 
Louis, MO) are utilized. In CoMFA, a particular series of compounds with measured 
activities are co-aligned in a manner that is believed to emulate their arrangement as they 
interact with the active site. A 3-dimensional lattice, or grid is then constructed to 

35 encompass the collection of the so-aligned compounds. At each point on the lattice, an 
evaluation of the potential energy is determined and tabulated-typically potentials that 
simulate the electronic and steric fields are determined, but other potential functions are 
available. Using the statistical methods such as PLS (Partial Least Squares), correlation 
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between the measured activities and the potential energy values at the grid-points can be 
determined and summed in a linear equation to produce the overall molecular correlation 
or QSAR model. A particularly useful feature in CoMFA is that the individual 
contribution for each grid-point is known; me importance of the grid points upon the 

5 overall correlation can be visualized graphically in what is referred to as a CoMFA field. 
When this field is combined with the original compound alignment, it becomes a 
powerful tool to rationalize the activities of the individual compounds from whence the 
model was derived, and to predict how chemical modification of a reference compound 
would be effected. As an example, a QSAR model was developed for a set of 92 

10 benzodiazepines using the method described above. A representative CoMFA field is 
shown in Figure 4; the region delineated by wire mesh (adjacent to the referenced 
triazinoxazepine) is the region where chemical modification characterized by increasing 
steric bulk would lead to favorable effects in transcription factor modulation. 

The invention pertains, per se, to not only the methods for identifying the 

1 5 transcription factor modulating compounds, but to the compounds identified by the 
methods of the invention as well as methods for using the identified compounds. 

VIII. MarA family Modulating Compounds, and Methods of Use thereof 

In an embodiment, the invention pertains to methods for modulating a 
20 transcription factor, e.g., an HTH protein, an AraC family polypeptide, or a MarA family 
polypeptide. The method includes contacting the transcription factor, e.g., a MarA 
family polypeptide, with a transcription factor modulutating compound of the formula 

CD: 

25 A-E CD 

wherein A is a polar moiety, E is a hydrophobic moiety, and 
pharmaceutically acceptable salts thereof. The transcription factor modulating 
compound, e.g., a MarA family modulating compound, may comprise one or more polar 

30 moieties and/or one or more hydrophobic moieties. 

In another embodiment, the invention pertains to methods for reducing 
antibiotic resistance of a microbial cell. The method includes contacting the cell with a 
transcription factor modulating compound, e.g., a MarA family modulating compound, 
such that the antibiotic resistance of the cell is reduced. 

35 jn another embodiment, the invention pertains to inhibiting transcription, 

comprising contacting a transcription factor with a transcription factor modulating 
compound, such that transcription is inhibited. In a further embodiment, the 
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transcription of a prokaryotic cell is inhibited In another further embodiment, the 
transcription factor modulating compound is a compound of anyone of formulae (T)-(X). 

The term "antibiotic resistance" includes resistance of a microbial cell to 
a antibiotic compound, especially an antibiotic compound which had been previously 
5 used to treat similar microbial organisms successfully. 

The term "polar moiety" includes moieties with at least one heterocycle. 
It also includes moieties such as, but not limited to, hydroxyl, halogens, thioethers, 
carboxylic acids, metals (e.g. alkali, alkaline, Au, Hg, Ag, Mn, Co, Cu, Zn, etc.), nitro, 
amino, alkoxy, and other moieties which allow the compound to perform its intended 
10 function. The term "polar moiety" includes moieties which allow the transcription factor 
modulating compound to perform its intended function, e.g., modulate a transcription 
factor, e.g., an AraC family polypeptide or a MarA family polypeptide. A heterocyclic 
polar moiety may comprise one or more rings, one or more of which maybe aromatic. 
In an embodiment, one or more rings of the polar moiety are fused. The heterocyclic 
1 5 polar moiety may also be bicyclic. 

The heterocyclic polar moiety may comprise one or more nitrogen, sulfur, 
or oxygen atoms. Examples of heterocycles include benzodioxazole, benzofaran, 
benzoimidazole, benzoxazole, benzothiazole, benzothiophene, chromenone, 
deazapurine, furan, imidazole, imidazopyridine, indole, indolizine, isooxazole, 
20 isothiaozole, isoquinoline, methylenedioxyphenyl, napthridine, oxazole, purine, 
pyrazine, pyrazole, pyridazine, pyridine, pyrimidine, pyrrole, pyrollidine, quinoline, 
tetrazole, thiazole, thiophene, triazole, and triazoletetrazole. 

Furthermore, the polar moiety may be substituted when chemically 
feasible. For example, the polar moiety may be substituted with one or more 
25 substituents such as alkyl, alkenyl, alkynyl, halogen, hydroxyl, alkylcarbonyloxy, 

arylcarbonyloxy, alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, alkylcarbonyl, 
arylcarbonyl, alkoxycarbonyl, aminocarbonyl, alkylaminocarbonyl, 
dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, phosphate, phosphonato, phosphinato, 
cyano, amino (including alkyl amino, dialkylamino, arylamino, diarylamino, and 
30 alkylarylamino), acylamino (including alkylcarbonylamino, arylcarbonylamino, 

carbamoyl and ureido), amidino, imino, sulfhydryl, alkylthio, arylthio, thiocarboxylate, 
sulfates, alkylsulfinyl, sulfonato, sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, 
azido, heterocyclyl, alkylaryl, or an aromatic or heteroaromatic moiety. Examples of 
substituents also include nitro, alkoxy, aryl, amidyl, ester, thioester, alkyl (eg., methyl, 
35 ethyl, propyl, butyl, pentyl, etc.), araalkyl (<?.g., substituted or unsubstituted benzyl), 
hydroxy, halogen (e.g., fluorine, bromine, chlorine, iodine, etc.). 
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The term ''hydrophobic moiety" includes moieties such as which allow 
the transcription factor modulating compound (e.g., an HTH protein modulating 
compound, an AraC family polypeptide modulating compound, a MarA family 
polypeptide modulating compound, etc.) to perform its intended function, e.g., modulate 
5 a transcription factor. Examples of hydrophobic moieties include, for example, 

hydrogen, alkyl, alkenyl, alkynyl, and aryl moieties. The hydrophobic moieties may be 
unsubstituted or substituted, if chemically feasible {e.g., not hydrogen). In an 
embodiment, the hydrophobic moiety is substituted or unsubstituted phenyl. Examples 
of substituents include alkyl, alkenyl, alkynyl, alkoxy, halogen, amino, thiol, hydroxy, 

10 nitro, aryl, and heteroaryl. The substituents can be substituted or unsubstituted. la an 
embodiment, the phenyl hydrophobic moiety is /?ara-substituted, e.g., alkyl (methyl, 
ethyl, propyl, butyl, pentyl, etc.), halogen (e.g., fluorine, bromine, chlorine, iodine, etc.), 
hydroxy, substituted. 

In another embodiment, the hydrophobic moiety is heterocyclic. 

15 Examples of heterocyclic hydrophobic moieties include imidazopyridine, quinolinyl, 
pyridinyl, etc. 

In one embodiment, the transcription factor modulating compound (e.g., 
MarA family polypeptide modulating compound, AraC family polypeptide modulating 
compound, etc.) is of the formula (VE): 



w 



20 




(VD) 



wherein 

WisNH,OorS; 

X is O, S, or C, optionally linked to Q; 
AMsC-Z^CorS; 
25 A 2 isC-Z 2 ,0,orS; 

A 3 is C-Z 3 , 0,orS; 
A 4 isC-Z 4 , 0,orS; 
A 5 is C-Z 5 , orN-Z 5 ; 

Z , Z , Z , and Z are each independently selected from the group 
30 consisting of hydrogen, alkoxy, hydroxy, halogen, and alkyl; 

Z 5 is hydrogen, alkoxy, hydroxy, halogen, alkyl, or carbonyl; 
Q is hydrogen, alkyl, alkenyl, alkynyl, halogen, hydroxy, aryl, and 
pharmaceutically acceptable salts thereof. 
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In yet another embodiment, the transcription factor modulating compound 

(e.g., the MarA family polypeptide modulating compound, AraC family polypeptide 

modulating compound, etc.) is of the formula (D): 

z 1 w 




5 wherein 

WisOorS; 

X is O, S, or C, optionally linked to Q; 
A 1 is C-Z 4 , O, or S; 
A 2 isC-Z 5 ,orN-Z 5 ; 

10 Z l , Z 2 , Z 3 , Z 4 and Z 5 are each independently hydrogen, alkoxy, hydroxy, 

halogen, alkyl, alkenyl, alkynyl, aryl, heterocyclic, amino, or cyano; 

Z 3 is hydrogen, alkoxy, hydroxy, halogen, alkyl, alkenyl, alkynyl, aryl, 
heterocyclic, amino, nitro, cyano, carbonyl, or thiocarbonyl; 

Q is an aromatic or heterocyclic moiety, and pharmaceutically acceptable 

15 salts thereof. 

In a further embodiment, W may be oxygen and X may be oxygen. 
Furthermore, A 1 and A 2 may be C-Z 4 and C-Z 5 , respectively. Examples of Z 4 and 
Z 5 include hydrogen and hydroxy. Examples of Z l and Z 2 include hydrogen and hydroxy. 
Other examples of Z 2 also include halogen, e.g., fluorine, chlorine, bromine, and iodine. 

20 Examples of Z 3 include, for example, hydrogen, alkoxy and hydroxy. Examples of Q 
include substituted and unsubstituted phenyl. The phenyl may be para-substituted. 
Examples of substituents include hydroxyl, halogen {e.g., fluorine, bromine, chlorine, 
iodine, etc.), amino, alkyl (e.g., methyl, ethyl, propyl, butyl, pentyl, etc.), nitro, cyano, 
etc. In an embodiment, the transcription factor modulating compound is a MarA 

25 modulating compound, and in a further embodiment, a MarA inhibiting compound. 

In another embodiment, the transcription factor modulating compound 
(e.g., an AraC family polypeptide modulating compound, a MarA family polypeptide 
modulating compound, etc.) is of the formula (VHI): 




(vm) 



30 wherein: 
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G is a substituted or unsubstituted aromatic moiety, heterocyclic, alkyl, 
alkenyl, alkynyl, hydroxy, cyano, nitro, amino, carbonyl, or hydrogen; 

L l , L 2 , L 3 , L 4 , L 9 and L 10 are each independently oxygen, sulfur, 
substituted or unsubstituted nitrogen, and substituted or unsubstituted caibon; and 

L andL are each independently hydrogen, substituted or unsubstituted 
alkyl, alkenyl, alkynyl, acyl, heterocyclic, amino, nitro, hydroxy, cyano, alkoxy, or aiyl, 
and L 5 and L 6 may optionally be linked with a chain of one to six atoms to form a fused 
ring, and phannaceutically acceptable salts thereof. 



10 In another embodiment, the transcription factor modulating compound 

(e.g., an AraC family polypeptide modulating compound, a MarA family polypeptide 
modulating compound, etc.) is of the formula (IX): 




NR 9 



OX) 
wherein: 

15 G is substituted or unsubstituted aromatic moiety, heterocyclic, alkyl, 

alkenyl, alkynyl, hydroxy, cyano, nitro, amino, carbonyl, or hydrogen; 

1 2 3 4 • 

L , L , L , and L are each independently oxygen, sulfur, substituted or 
unsubstituted nitrogen, and substituted or unsubstituted carbon; and 

R 9 , L 5 and L 6 are each independently hydrogen, substituted or 
20 unsubstituted alkyl, alkenyl, alkynyl, acyl, heterocyclic, amino, nitro, hydroxy, cyano, 
alkoxy, or aryl, and L5 and I* may optionally be linked with a chain of one to six atoms 
to form a fused ring, and phannaceutically acceptable salts thereof. 

In another embodiment, the transcription factor modulating compound 
(e.g., an AraC family polypeptide modulating compound, a MarA family polypeptide 
25 modulating compound, etc.) is of the formula (ID): 

/ 




wherein 
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G is substituted or unsubstituted aromatic moiety, heterocyclic, alkyl, 
alkenyl, alkynyl, hydroxy, cyano, nitro, amino, carbonyl, or hydrogen; and 

L 1 , L 2 , L 3 , L 4 , L 5 , L 6 , L 7 , L 8 , L 9 , and L 10 are each independently oxygen, 
substituted or unsubstituted nitrogen, sulfur and or substituted or unsubstituted carbon, 
5 and pharmaceutical^ acceptable salts thereof. 

In a further embodiment, L 9 is N-R 9 , wherein R 9 is hydrogen, substituted 
or unsubstituted alkyl, alkenyl, alkynyl, acyl, or aryl. In another, L 10 is oxygen. In an 
embodiment, R 9 is hydrogen, hi another, G is substituted or unsubstituted phenyl or 
heteroaryl. In a further embodiment, G is cycloalkenyl, e.g., cyclohexenyl. In one 
10 embodiment, L l , L 2 , L 3 , and L 4 are each substituted or unsubstituted carbon and L 5 , L 6 , 
and L 8 are each nitrogen. L 7 may be substituted carbon, e.g., substituted with a thioether 
moiety. In another embodiment, L 9 and L 10 are each nitrogen. In another embodiment, 
the invention pertains to compounds of formula (EI), wherein L 9 is nitrogen, L 10 is 
oxygen, L^L 8 are each C-H, the dotted line represents a double bond and where G is not 
1 5 hydrogen or methyl. 

In another embodiment, the transcription factor modulating compound 
(e.g., an AraC family polypeptide modulating compound, a MarA family polypeptide 
modulating compound, etc.) is of the formula (X): 



Y. 1 




20 wherein 

Y 1 and Y 2 are each oxygen, sulfur, or substituted or unsubstituted carbon; 
J 1 , J 2 , J 3 , and J 4 are each oxygen, nitrogen, or optionally substituted 
carbon, and pharmaceutical^ acceptable salts thereof. 



25 In another embodiment, the transcription factor modulating compound 

(e.g., an AraC family polypeptide modulating compound, a MarA family polypeptide 
modulating compound, etc.) is of the formula (TV): 




wherein 

30 
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Y 1 and Y 2 are each oxygen or sulfur; 

J is hydrogen, substituted or unsubstituted alkyl, alkenyl, alkynyl, cyano, 
nitro, amino, or halogen; 

V is substituted or unsubstituted alkyl, alkenyl, alkynyl, alkoxy, 
5 alkylamino, or alkylthio; 

P and K are each independently substituted or unsubstituted aryl, and 
pharmaceutically acceptable salts thereof. 

In a further embodiment, Y 1 and Y 3 are each oxygen, V is alkoxy and J is 
lower alkyl. In another embodiment, P is substituted or unsubstituted phenyl. K may be 
1 0 substituted or unsubstituted heteroaryL 

In another embodiment, the transcription factor modulating compound 
(e.g., an AraC family polypeptide modulating compound, a MarA family polypeptide 
modulating compound, etc.) is of the formula (V): 

15 




(V) | 

wherein j 
T 1 , T 2 , T 3 , T 4 , T 5 , and T 6 are each independently substituted or 

unsubstituted carbon, oxygen, substituted or unsubstituted nitrogen, or sulfur; 
20 M is hydrogen, alkyl, alkenyl, alkynyl, heterocyclic or aryl, or i 

pharmaceutical^ acceptable salts thereof. ! 
In a further embodiment, T 5 is N-W or C-W, wherein W is alkyl, alkenyl, 

alkynyl, aryl, heterocyclic, acyl, hydroxy, alkoxy, alkthio, amino, nitro, halogen, or 

hydrogen. In another further embodiment, T 6 is N. 
25 hi a further embodiment, M is substituted or unsubstituted aryl. Wmay 

be substituted or unsubstituted alkyl. In another embodiment, T 1 , T 2 , T 3 and T 4 are each 

substituted or unsubstituted carbon. In a further embodiment, at least one of T 1 , T 2 , T 3 , 

and T 4 is nitrogen, and the remaining T moieties are substituted or unsubstituted carbon. 

In another embodiment, the transcription factor modulating compound j 
30 (e.g., an AraC family polypeptide modulating compound, a MarA family polypeptide j 

modulating compound, etc.) is of the formula (Va): 
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(Va) 



wherein 

R 1 is OH, OCOC0 2 H, or a substituted or unsubstituted straight or 
5 branched C1-C5 alkyloxy group; 

R 2 is H, C02(Ci-C s substituted or unsubstituted, straight or branched 
alkyl), or a substituted or unsubstituted aryl group; and 

R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of 
H, (C1-C5 substituted or unsubstituted, straight or branched alkyl), 
10 C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 
CO(C 3 -C 6 substituted or unsubstituted cycloalkyl), 
0(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
15 C(NOH)(CrC 5 substituted or unsubstituted, straight or branched alkyl), 

substituted or unsubstituted amino, C0 2 H, CN, NO2, CONH 2 , (CO)(NHOH), and 
halogen 

hi certain embodiments of formula Va, those compounds disclosed in 
U.S. 10/139,591, filed May 6, 2002, are excluded from the scope of the present 
20 invention. 

mother embodiments of formula Va, when R isN0 2 andR is 
unsubstituted phenyl, then R' is not 0(CHCH 3 )(C02)CH 2 CH 3 or OCH2CO2H. Also, in 
another embodiment, when R 6 is H or NO2, then R 1 is not a phenyl-substituted alkyloxy 
group. In yet another embodiment, when R 4 , R 5 , R 6 , and R 7 are all H and R 2 ispara- 
25 methoxyphenyl, men R l is not OH. And in another embodiment, when R 4 , R 5 , R 6 , and 
R 7 are all H and R 2 is unsubstituted phenyl, then R l is not OCH2CO2CH2CH3 ; 

In certain aspects of formula Va, R 4 , R 5 , and R 7 are all H. 

Similarly, R 1 of formula Va may be selected from the group consisting of 
OH, 0(CR'R")i-3H, 0(CR'R")i- 3 OH, 0(CR'R")i- 3 C0 2 H, 0(CR'R")i- 3 C02(CR'R")i- 
30 3H, 0(CR'R")i- 3 (CO)NH 2 , 0(CR'R")i- 3 (CNH)NH 2 , OCOC0 2 H, 0(CR'R VsSOaH, 
0(CR'R")i- 3 OS0 3 H, 0(CR'R")i- 3 P03H, OCCR'R'OwOPOaH, 
0(CR'R")i- 3 N[(CR'R' ')<«H] 2 , 0(CR'R")i- 3 (CO)(NHOH), and 0(CR'R")i- 
3 (heteroaryl); 
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wherein R' and R" are each independently H, a C1-C3 alkyl, C2-C3 alkenyl, or C2-C3 

alkynyl group. Each R' and R" is preferably H or CH 3 . 

When R 1 of formula Va is 0(CR'R")i-3(heteroaryl), the heteroaryl group 

maybe a pyrrolyl, furanyl, thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, 
5 tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or ; 

pyrimidinyl group. I 
Similarly, when R 2 of formula Va may be a substituted or unsubstituted 

phenyl, pyrrolyl, furanyl, thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, 

tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or 
10 pyrimidinyl group. 

In a more particular embodiment, R 6 of formula Va is H, (CR'R")i- 3 H, 

(CR'R'OwOH, (CR'R")i- 3 NH 2 , (NOH)(CR'R")i- 3 H, CO(CR'R")<«NH 2 , 

CO(CR'R")i-3H, CO(CR'R")i-30H, CO(CR'R")o-3CF 3 , 

(CR'R")o-3N[(CR'R'V3H] 2 , CO(substituted or unsubstituted heteroaryl), 
15 CO(C 3 -C 6 substituted or unsubstituted cycloalkyl), 0(CR'R")i-3H, 

CO(substituted or unsubstituted phenyl), C0 2 (CR'R")o-3H, CN, N0 2 , F, CI, Br, or I, 

wherein R' and R" are each independently H, a C1-C3 alkyl, Qr-Cs alkenyl, or Cr-C 3 

alkynyl group. Preferably each R' and R" is independently H or CH 3 . 

hi yet another embodiment, R 6 of formula Va is CO(substituted or 
20 unsubstituted heteroaryl), wherein said heteroaryl group is a pyrrolyl, furanyl, 

thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, tetrazolyl, pyrazolyl, oxazolyl, 

isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or pyrimidinyl group. 

hi another embodiment, R 4 , R 5 , and R 7 are each hydrogen; R 6 is N0 2 , and 

R 1 is hydroxyl. In a further embodiment, R 2 is substituted aryl, e.g., substituted phenyl, 
25 substituted furanyl, or substituted benzoimidazole. In a further embodiment, when R 2 is 

substituted phenyl, R 2 is substituted with an optionally substituted arylcarbonylamino 

group, an amino group, a dialkyl amino group, or a carboxylate group. The aryl 

carbonylamino group may be substituted with dialkyl amino, alkyl, or halogens. In a 

further embodiment, when R 2 is a substituted furanyl group, R 2 is substituted with an 
30 aryl group, e.g., phenyl. In another embodiment, when R 2 is an optionally substituted 

benzoimidazole, it is substituted with an alkyl group. , 

i 

In another embodiment, the transcription factor modulating compound 
(e.g., an AraC family polypeptide modulating compound, a MarA family polypeptide j 
35 modulating compound, etc.) is of the formula (VI): ! 

i 

i 
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wherein 

G 1 , G 2 , and G 3 are each independently O, S, substituted or unsubstituted 
nitrogen, or substituted or- unsubstituted caibon; 
5 E 1 , E 2 , and E 3 are each independently hydrogen, alkyl, alkenyl, alkynyl, 

aryl, aralkyl, or acyl; and 

E 4 is alkyl, alkenyl, alkynyl, aiyl, halogen, cyano, amino, nitro, or acyl, 
and pharmaceutically acceptable salts thereof. 

In a further embodiment, G 1 , G 2 and G 3 are each oxygen. 

10 

Other transcription factor modulating compounds are shown in Table 3. 
The invention pertains to each of these compounds, methods (both therapeutic and 
otherwise) using each of the compounds, and compositions comprising at least one of 
the compounds of Table 4, Table 5, Table 6, Table 7, Table 8 or of formulae (I), (II), 
15 (III), (IV), (V), (VT), (VII), (Vm), (K) or (X). 

The invention also pertains to each of the following compounds: 

2-(4-isopropylphenyl)-4H-chromen-4-one; 

2-(3,4-Dihydroxy-phenyl)-3,5,7-trihydroxy-chromen-4-one 

N-isopropyl-2-[(4-me%l-5-qum^ 

20 yl)thio]acetamide; 

4-hydroxy-6-methyl-5,6-dihydro^ 

5,7-Dihydroxy-2-(4-hydroxy-phenyl)-chromen-4-one; 

2-[4-(dimethylaniino)phenyl]-4H-chromen-4-one; 

l-(benzyloxy)-2-phenyl-lH-imidazo[4,5-b]pyridine; 
25 2-(benzyltMo)^phenyl-5<l-phenyl4H4^,3,4-tetraazol-5^ 

yl)pyrimidine; 

6- fluoro-2-phenyl-4H-chromen-4-one; 

7- methoxy-2-phenyl-4H-chromen-4-one; 
4-(l,3-(hoxo-l,3^ydro-2H-inden-2-yliden)-2-phenyl-6-(2-- 

30 pyridinyl)tetrahydropyrrolo[3,4-c]pyrrole-l,3(2H,3^ 
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2-(2-Hydroxy-3-oxo-5-p-tol^^^ 

ethyl ester; 

24(6-mtro-2-phenyl-lH-l ,3-b 

2- (4-fluorophenyl)-4H-chromen-4-one; 
5 1 -methoxy-2-(4-methylphetoyl)-lH^ 

6-(5Jodo-furan-2-yl)-3-methylsulfanyl-6J-dihydro- 
tetraaza-dibenzo [a,c]cycloheptene; 

6-(4-Ethoxy-phenyl)-3-methylsdf^ 
tetraaza-dibenzo [a,c]cycloheptene; 
10 3-Methylsulfanyl-6-(5-nitro-^^ 
tetraaza-dibenzo [a,c]cycloheptene; 

3- Methylsulfanyl-6-[5-(4-nto^ 
1,2,4,7-tetraaza -dibenzo [a,c] cycloheptene; 

4- (3-Ethylsulfanyl-6 ,7-dihydro-5-oxa-l,2,4,7-tetraaza -dibenzo [a,c] 
15 cyclohepten-6-yl)-benzene-l,2«diol; 

6-(4-Benzyloxy-phenyl)-3-propylsulf^ 

tetraaza-dibenzo [a,c]cycloheptene; 

6-Benzo[13]dioxol-5-yl-3-methylsidfanyl-6,7-dihydro-5-oxa-l,2,4^ 

tetraaza-dibenzo [a,c]cycloheptene; 
20 3-Butylsulfanyl-6-(2>dimethoxy-pheny^^^^^ 
tetraaza -dibenzo[a,c] cycloheptene; 

6-(4-Allyloxy-phenyl)-3-butylsu^^ 
tetraaza-dibenzo[a,c]cycloheptene; 

3-Butylsulfanyl-6-(4-etho^ 
25 dibenzo [a,c] cycloheptene; 

6-(4-Methoxy-phenyl)-3-propylsxilfanyl-6 > 7-dihydro-5-oxa-l ,2,4,7- 

tetraaza-dibenzo [a,c]cycloheptene; 

6-[5-(3-Nitrcnphenyl)-furan-2-yl]-3-propylsulfanyl-6 
1,2,4,7-tetraaza -dibenzo[a,c] cycloheptene; 
30 2-(3-PhenyHH-pyrazol-4-ylmethylene)-benzo[4,5] imidazo[2,l- 

b]thiazol-3-one; 

2-[5-(3-Carboxy-phenyl)-furan-2-ylmethylene]-5-(2-methoxy-naphthalen- 
l-yl)-7-methyl-3-oxo-2,3-dihyd^ acid ethyl 

ester; 

35 5-(4-Dimethylamino-phenyl)-7^ 

furan-2-yl methylene]-3-oxo-2,3-dihydro -5H-thiazolo[3,2-a] pyrimidine-6-caiboxylic 
acid ethyl ester; 
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5-Benzo[l 3]dioxol-5-yl-7-methyl-2-[5-(2-methy^^^ 
2-yl methylene]-3-oxo-2,3-dihydro -5H-thiazolo[3,2-a] pyrimidine-6-carboxylic acid 
ethyl ester; 

5-(3,4-Dime1hoxy-phenyl)-7-methyl-2-[5-(2-methyl^nitro -phenyl)- 
5 furan-2-yl methylene]-3-oxo-2 9 3-dihydro-5H-thi^ 
acid ethyl ester; 

7-Methyl-2-[5-(2-methyl-4-i^ 
methyl sulfanyl-phenyl)-3-oxo-23-dihydro-5H-thiazolo[3,2-a] pyrimidine-6-carboxylic 
acid ethyl ester; 
10 2-[5-(4-Carboxy-phenyl)-fiiran-2-ylme 

l-yl)-7-methyl-3-oxo-2,3-dihydro^ acid ethyl 

ester; 

5-Benzo[l,3]dioxol-5-yl-2-[5-(4-ethoxycarbonyl-phenyl)-fliran-2- 
ylmethylene]-7-methyl-3-oxo-2,3-dihy^ 
15 acid ethyl ester; 

7-Methyl-3-oxo-5-phenyl-2-[5^ 
ylmethylene]-2,3-dihydro^ acid ethyl ester; 

7-Methyl-2-[5-(2-methyl-4-nito^ 
phenyl-2,3-dihydro-5H-ttaazolo[ acid ethyl ester; 

20 2-[5-(3-Carboxy-phenyl)-fiiran-2-ylme 
phenyl)-7-methyl-3-oxo-2,3-dihydro-5H^^ 
ethyl ester; 

5-(4-Dimethylamino-phenyl)-7-metty^ 
furan-2-yl methy!ene]-3-oxo-2,3^ 
25 acid ethyl ester; 

2- [5-(3-Carboxy-phenyl)-furan-2-ylmethylene]-7-methyl-5-(4- 
methylsulfanyl-phenyl)-3-oxo-2,3-d^ 

acid ethyl ester; 

[l,2]Naphthoquinone l-[0-(6-oxo-6H-anthra[ 1 ,9-cd] isoxazol-5-yl)- 

30 oxime]; 

3- Acetyl-2,5,7-triphenyl-lH-l,3aA8-tetraaza-7a-azoiiia- 
cyclopenta[a]indene; 

l-Amino-3-benzo[l,3]dioxol-5-yl-benzo[4,5]imidazo[l^a]pyri 

dicarbonitrile; 

35 2-[2-(5.Furan-2-yl-4-phenyl-4H.[l ,2,4]triazol-3-yl sulfanyl)- 

acetylamino]-benzoic acid methyl ester; 
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6.7- Dimethyl-2-(3-phenyl-lH-pyrazoM^^ 
benzo[4,5]imidazo[2,l-b] thiazol-3-one; 

2-(5-Benzo[l,2,5]oxadiazol-5-yl-4-methyl-4H«[l,2,4] triazol-3-ylsulfanyl>N-(3- 
methylsulfanyl-phenyl)-acetamide; 
5 4-(l ^-Dioxo-indan-2-yUdene)-2-phenyl-6-pyridin-2-yH^ 

pyrrolo[3,4-c] pyrrole-l,3-dione; 

6-Nitro-2-phenyM-(3-trifl^^^ 
(6-Nitro-2-phenyl-benzoimidazol-l-yloxy)-aceticacid; 

1- Benzyloxy-6-nitro-2-phenyl-lH-benzoimidazole; 
10 l-(4-Methyl-benzyloxy)-6-nito^^ 

6.8- Dimethyl-2-(4-nitro-pheny^ 
cyclopenta[a]naphthalene-4,7,9-trione; 

6,8-Dimethyl-5-phenyl-2-p-tolyl-5H^ 
cyclopenta [a]naphthalene-4,7,9-trione; 
15 2-[3-(4-Fluoro-phenyl)-l-phenyHH-pyrazol-4-yl methylenej-benzo [4,5] 

imidazo[2,l-b]thiazol-3-one; 

Cobalt 5 J0,15^0-Tetra-pyridin-4-yl-porphyrine; 

2- [3-(4-Fluoro-pte 
vinyl-imidazo[2,l-b]thiazol-3-one; 

20 Cobalt 5,10,15,20-Tetra-pyridin-3-yl-porphyrine; 

Zinc 5,10,15,20-Tetra-pyridin^-yl-porphyrine; 
2-(4-hydroxyphenyl)-4H-chromen-4-one, and phannaceutically 

acceptable salts thereof. 

In a further embodiment, the transcription factor modulating compound is 

25 notapigenin. In another 

The term "alkyl" includes saturated aliphatic groups, including straight-chain 
alkyl groups (e.g., methyl, ethyl, propyl, butyl, pentyl, hexyl, heptyl, octyl, nonyl, decyl, 
etc.), branched-chain alkyl groups (isopropyl, tert-butyl, isobutyl, etc.), cycloalkyl 
(alicyclic) groups (cyclopropyl, cyclopentyl, cyclohexyl, cycloheptyl, cyclooctyl), alkyl 

30 substituted cycloalkyl groups, and cycloalkyl substituted alkyl groups. The term alkyl 
further includes alkyl groups, which can further include oxygen, nitrogen, sulfur or 
phosphorous atoms replacing one or more carbons of the hydrocarbon backbone. In 
certain embodiments, a straight chain or branched chain alkyl has 6 or fewer carbon 
atoms in its backbone (e.g., Ci-C 6 for straight chain, C3-C6 for branched chain), and 

35 more preferably 4 or fewer. Likewise, preferred cycloalkyls have from 3-8 carbon atoms 
in their ring structure, and more preferably have 5 or 6 carbons in the ring structure. The 
term Ci-Ce includes alkyl groups containing 1 to 6 carbon atoms. 
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Moreover, the teim alkyl includes both "unsubstituted alkyls" and "substituted 
alkyls", the latter of which refers to alkyl moieties having substituents replacing a 
hydrogen on one or more carbons of the hydrocarbon backbone. Such substituents can 
include, for example, alkenyl, alkynyl, halogen, hydroxyl, alkylcarbonyloxy, 

5 arylcarbonyloxy, alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, alkylcarbonyl, 
arylcarbonyl, alkoxycarbonyl, aminocarbonyl, alkylaminocarbonyl, 
dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, phosphate, phosphonato, phosphinato, 
cyano, amino (including alkyl amino, dialkylamino, arylamino, diarylamino, and 
alkylarylamino), acylamino (including alkylcarbonylamino, arylcarbonylamino, 

10 carbamoyl and ureido), amidino, imino, sulfhydryl, alkylthio, arylthio, thiocarboxylate, 
sulfates, alkylsulfinyl, sulfonate, sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, 
azido, heterocyclyl, alkylaryl, or an aromatic or heteroaromatic moiety. Cycloalkyls can 
be further substituted, e.g., with the substituents described above. An "alkylaryl" or an 
"arylalkyl" moiety is an alkyl substituted with an aryl (eg., phenylmethyl (benzyl)). The 

15 term "alkyl" also includes the side chains of natural and unnatural amino acids. 

The term "aryl" includes groups, including 5- and 6-membered single-ring 
aromatic groups that may include from zero to four heteroatoms, for example, benzene, 
phenyl, pyrrole, fiiran, thiophene, thiazole, isothiaozole, imidazole, triazole, tetrazole, 
pyrazole, oxazole, isooxazole, pyridine, pyrazine, pyridazine, and pyrimidine, and the 

20 like. Furthermore, the term "aryl" includes multicyclic aryl groups, e.g., tricyclic, 
bicyclic, e.g., naphthalene, benzoxazole, benzodioxazole, benzothiazole, 
benzoimidazole, benzothiophene, methylenedioxyphenyl, quinoline, isoquinoline, 
napthridine, indole, benzofuran, purine, benzofuran, deazapurine, or indolizine. Those 
aryl groups having heteroatoms in the ring structure may also be referred to as "aryl 

25 heterocycles", "heterocycles " "heteroaryls" or "heteroaromatics". The aromatic ring can 
be substituted at one or more ring positions with such substituents as described above, as 
for example, halogen, hydroxyl, alkoxy, alkylcarbonyloxy, arylcarbonyloxy, 
alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, alkylcarbonyl, 
alkylaminoacarbonyl, aiylalkyl aminocarbonyl, alkenylaminocarbonyl, alkylcarbonyl, 

30 arylcarbonyl, arylalkylcarbonyl, alkenylcarbonyl, alkoxycarbonyl, aminocarbonyl, 

alkylthiocarbonyl, phosphate, phosphonato, phosphinato, cyano, amino (including alkyl 
amino, dialkylamino, arylamino, diarylamino, and alkylarylamino), acylamino (including 
alkylcarbonylamino, arylcarbonylamino, carbamoyl and ureido), amidino, imino, 
sulfhydryl, alkylthio, arylthio, thiocarboxylate, sulfates, alkylsulfinyl, sulfonate, 

35 sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, azido, heterocyclyl, alkylaryl, or 
an aromatic or heteroaromatic moiety. Aryl groups can also be fused or bridged with 
alicyclic or heterocyclic rings which are not aromatic so as to form a polycycle (e.g., 
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tetralin). The term "aryl" also includes multicyclic aryl groups such as porphrins, 
phthalocyanines, etc. 

The term "alkenyl" includes unsaturated aliphatic groups analogous in length and 
possible substitution to the alkyls described above, but that contain at least one double 
5 bond. 

For example, the term "alkenyl" includes straight-chain alkenyl groups (e.g., 
ethylenyl, propenyl, butenyl, pentenyL, hexenyl, heptenyl, octenyl, nonenyl, decenyl, 
etc.), branched-chain alkenyl groups, cycloalkenyl (alicyclic) groups (cyclopropenyl, 
cyclopentenyl, cyclohexenyl, cycloheptenyl, cyclooctenyl), alkyl or alkenyl substituted 

10 cycloalkenyl groups, and cycloalkyl or cycloalkenyl substituted alkenyl groups. The 
term alkenyl further includes alkenyl groups which include oxygen, nitrogen, sulfur or 
phosphorous atoms replacing one or more carbons of the hydrocarbon backbone. In 
certain embodiments, a straight chain or branched chain alkenyl group has 6 or fewer 
carbon atoms in its backbone {e.g., C2-C6 for straight chain, C3-C6 for branched chain). 

15 Likewise, cycloalkenyl groups may have from 3-8 carbon atoms in their ring structure, 
and more preferably have 5 or 6 carbons in the ring structure. The term C2-C6 includes 
alkenyl groups containing 2 to 6 carbon atoms. 

Moreover, the term alkenyl includes both "unsubstituted alkenyls" and 
"substituted alkenyls", the latter of which refers to alkenyl moieties having substituents 

20 replacing a hydrogen on one or more carbons of the hydrocarbon backbone. Such 

substituents can include, for example, alkyl groups, alkynyl groups, halogens, hydroxyl, 
alkylcarbonyloxy, arylcarbonyloxy, alkoxycaibonyloxy, aryloxycarbonyloxy, 
carboxylate, alkylcarbonyl, arylcarbonyl, alkoxycarbonyl, aminocarbonyl, 
alkylaminocarbonyl, dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, phosphate, 

25 phosphonato, phosphinato, cyano, amino (including alkyl amino, dialkylamino, 

arylamino, diarylamino, and alkylarylamino), acylamino (including alkylcarbonylamino, 
arylcarbonylamino, carbamoyl and ureido), amidino, imino, sulfhydryl, alkylthio, 
arylthio, thiocarboxylate, sulfates, alkylsulfinyl, sulfonato, sulfamoyl, sulfonamido, 
nitro, trifluoromethyl, cyano, azido, heterocyclyl, alkylaryl, or an aromatic or 

30 heteroaromatic moiety. 

The term "alkynyl" includes unsaturated aliphatic groups analogous in length and 
possible substitution to the alkyls described above, but which contain at least one triple 
bond. 

For example, the term "alkynyl" includes straight-chain alkynyl groups (e.g., 
35 ethynyl, propynyl, butynyl, pentynyl, hexynyl, heptynyl, octynyl, nonynyl, decynyl, etc.), 
branched-chain alkynyl groups, and cycloalkyl or cycloalkenyl substituted alkynyl 
groups. The term alkynyl further includes alkynyl groups which include oxygen, 
nitrogen, sulfur or phosphorous atoms replacing one or more carbons of the hydrocarbon 
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backbone. In certain embodiments, a straight chain or branched chain alkynyl group has 
6 or fewer carbon atoms in its backbone {e.g., C2-C6 for straight chain, C3-C6 for 
branched chain). The term C2-C6 includes alkynyl groups containing 2 to 6 carbon 
atoms. 

5 Moreover, the term alkynyl includes both "unsubstituted alkynyls" and 

"substituted alkynyls", the latter of which refers to alkynyl moieties having substituents 
replacing a hydrogen on one or more carbons of the hydrocarbon backbone. Such 
substituents can include, for example, alkyl groups, alkynyl groups, halogens, hydroxyl, 
alkylcarbonyloxy, arylcarbonyloxy, alkoxycarbonyloxy, aryloxycarbonyloxy, 

10 carboxylate, alkylcarbonyl, arylcarbonyl, alkoxycarbonyl, aminocarbonyl, 

alkylaminocarbonyl, dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, phosphate, 
phosphonato, phosphinato, cyano, amino (including alkyl amino, dialkylamino, 
arylamino, diarylamino, and alkylarylamino), acylamino (including alkylcaxbonylamino, 
arylcarbonylamino, carbamoyl and ureido), amidino, imino, sulfhydryl, alkylthio, 

15 arylthio, thiocaiboxylate, sulfates, alkylsulfinyl, sulfonato, sulfamoyl, sulfonamido, 
nitro, trifluoromethyl, cyano, azido, heterocyclyl, alkylaryl, or an aromatic or 
heteroaromatic moiety. 

Unless the number of carbons is otherwise specified, "lower alkyl" as used herein 
means an alkyl group, as defined above, but having from one to five carbon atoms in its 

20 backbone structure. "Lower alkenyl" and "lower alkynyl" have chain lengths of, for 
example, 2-5 carbon atoms. 

The term "acyl" includes compounds and moieties which contain the acyl radical 
(CH3CO-) or a carbonyl group. The term "substituted acyl" includes acyl groups where 
one or more of the hydrogen atoms are replaced by for example, alkyl groups, alkynyl 

25 groups, halogens, hydroxyl, alkylcarbonyloxy, arylcarbonyloxy, alkoxycarbonyloxy, 
aryloxycarbonyloxy, carboxylate, alkylcarbonyl, arylcarbonyl, alkoxycarbonyl, 
aminocarbonyl, alkylaminocarbonyl, dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, 
phosphate, phosphonato, phosphinato, cyano, amino (including alkyl amino, 
dialkylamino, arylamino, diarylamino, and alkylarylamino), acylamino (including 

30 alkylcarbonylamino, arylcarbonylamino, carbamoyl and ureido), amidino, imino, 
sulfhydryl, alkylthio, arylthio, thiocarboxylate, sulfates, alkylsulfinyl, sulfonato, 
sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, azido, heterocyclyl, alkylaryl, or 
an aromatic or heteroaromatic moiety. 

The term "acylamino" includes moieties wherein an acyl moiety is bonded to an 

35 amino group. For example, the term includes alkylcarbonylamino, arylcarbonylamino, 
carbamoyl and ureido groups. 
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The term "aroyl" includes compounds and moieties with an aryl or 
heteroaromatic moiety bound to a carbonyl group. Examples of aroyl groups include 
phenylcarboxy, naphthyl carboxy, etc. 

The terms "alkoxyalkyl", "alkylaminoalkyl" and "thioalkoxyalkyl" include alkyl 
5 groups, as described above, which further include oxygen, nitrogen or sulfur atoms 
replacing one or more carbons of the hydrocarbon backbone, eg., oxygen, nitrogen or 
sulfur atoms. 

The term "alkoxy" includes substituted and unsubstituted alkyl, alkenyl, and 
alkynyl groups covalently linked to an oxygen atom. Examples of alkoxy groups include 

10 methoxy, ethoxy, isopropyloxy, propoxy, butoxy, and pentoxy groups. Examples of 
substituted alkoxy groups include halogenated alkoxy groups. The alkoxy groups can be 
substituted with groups such as alkenyl, alkynyl, halogen, hydroxyl, alkylcarbonyloxy, 
arylcarbonyloxy, alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, alkylcarbonyl, 
arylcaibonyl, alkoxycarbonyl, aminocarbonyl, alkylaminocarbonyl, 

1 5 dialkylaminocarbonyl, alkylthiocarbonyl, alkoxyl, phosphate, phosphonato, phosphinato, 
cyano, amino (including alkyl amino, dialkylamino, arylamino, diarylamino, and 
alkylarylamino), acylamino (including alkylcarbonylamino, arylcarbonylamino, 
carbamoyl and ureido), amidino, imino, sulfhydryl, alkylthio, arylthio, thiocarboxylate, 
sulfates, alkylsulfinyl, sulfonato, sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, 

20 azido, heterocyclyl, alkylaryl, or an aromatic or heteroaromatic moieties. Examples of 
halogen substituted alkoxy groups include, but are not limited to, fluoromethoxy, 
difluoiomethoxy, trifluoromethoxy, chloromethoxy, dichloromethoxy, trichloromethoxy, 
etc. 

The term "amine" or "amino" includes compounds where a nitrogen atom is 
25 covalently bonded to at least one carbon or heteroatom. The term "alkyl amino" 

includes groups and compounds wherein the nitrogen is bound to at least one additional 
alkyl group. The term "dialkyl amino" includes groups wherein the nitrogen atom is 
bound to at least two additional alkyl groups. The term "arylamino" and "diarylamino" 
include groups wherein the nitrogen is bound to at least one or two aryl groups, 
30 respectively. The term "alkylarylamino," "alkylaminoaryl" or "arylaminoalkyl" refers to 
an amino group which is bound to at least one alkyl group and at least one aryl group. 
The term "alkaminoalkyl" refers to an alkyl, alkenyl, or alkynyl group bound to a 
nitrogen atom which is also bound to an alkyl group. 

The term "amide" or "aminocarboxy" includes compounds or moieties which 
35 contain a nitrogen atom which is bound to the carbon of a carbonyl or a thiocarbonyl 
group. The term includes "alkaminocarboxy" groups which include alkyl, alkenyl, or 
alkynyl groups bound to an amino group bound to a carboxy group. It includes 
arylaminocarboxy groups which include aryl or heteroaryl moieties bound to an amino 
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group which is bound to the carbon of a carbonyl or thiocarbonyl group. The terms 
"alkylaminocarboxy," "alkenylaminocarboxy ," "alkynylaminocarboxy," and 
"arylaminocaiboxy" include moieties wherein alkyl, alkenyl, alkynyl and aryl moieties, 
respectively, are bound to a nitrogen atom which is in turn bound to the carbon of a 

5 carbonyl group. 

The term "carbonyl" or "carboxy" includes compounds and moieties which 
contain a carbon connected with a double bond to an oxygen atom. Examples of 
moieties which contain a carbonyl include aldehydes, ketones, carboxylic acids, amides, 
esters, anhydrides, etc. 
1 0 The term "thiocarbonyl" or "thiocarboxy" includes compounds and moieties 

which contain a carbon connected with a double bond to a sulfur atom. 

The term "ether" includes compounds or moieties which contain an oxygen 
bonded to two different carbon atoms or heteroatoms. For example, the term includes 
"alkoxyalkyl" which refers to an alkyl, alkenyl, or alkynyl group covalently bonded to an 
1 5 oxygen atom which is covalently bonded to another alkyl group. 

The teim "ester" includes compounds and moieties which contain a carbon or a 
heteroatom bound to an oxygen atom which is bonded to the carbon of a carbonyl group. 
The term "ester" includes alkoxycarboxy groups such as methoxycarbonyl, 
ethoxycarbonyl, propoxycarbonyl, butoxycarbonyl, pentoxycarbonyl, etc. The alkyl, 
20 alkenyl, or alkynyl groups are as defined above. 

The term "thioether" includes compounds and moieties which contain a sulfur 
atom bonded to two different carbon or hetero atoms. Examples of thioethers include, 
but are not limited to alkthioalkyls, alkthioalkenyls, and alkthioalkynyls. The term 
"alkthioalkyls" include compounds with an alkyl, alkenyl, or alkynyl group bonded to a 
25 sulfur atom which is bonded to an alkyl group. Similarly, the term "alkthioalkenyls" and 
alkthioalkynyls" refer to compounds or moieties wherein an alkyl, alkenyl, or alkynyl 
group is bonded to a sulfur atom which is covalently bonded to an alkynyl group. 
The term "hydroxy" or "hydroxyl" includes groups with an -OH or -0\ 
The term "halogen" includes fluorine, bromine, chlorine, iodine, etc. The term 
30 "perhalogenated" generally refers to a moiety wherein all hydrogens are replaced by 
halogen atoms. 

The terms "polycyclyl" or "polycyclic radical" refer to two or more cyclic rings 
(e.g., cycloalkyls, cycloalkenyls, cycloalkynyls, aryls and/or heterocyclyls) in which two 
or more carbons are common to two adjoining rings, e.g., the rings are "fused rings". 
35 Rings that are joined through non-adjacent atoms are termed bridged" rings. Each of 
the rings of the polycycle can be substituted with such substituents as described above, 
as for example, halogen, hydroxyl, alkylcarbonyloxy, arylcarbonyloxy, 
alkoxycarbonyloxy, aryloxycarbonyloxy, carboxylate, alkylcarbonyl, alkoxycarbonyl, 
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alkylaminoacarbonyl, arylalkylaininocarbonyl, alkenylaminocarbonyl, alkylcarbonyl, 
arylcarbonyl, arylalkyl carbonyl, alkenylcarbonyl, aminocaibonyl, alkylthiocaibonyl, 
alkoxyl, phosphate, phosphonato, phosphinaio, cyano, amino (including alkyl amino, 
dialkylamino, arylamino, diarylamino, and alkylarylamino), acylamino (including 

5 alkylcarbonylamino, arylcarbonylamino, carbamoyl and ureido), amidino, imino, 
sulfhydiyl, alkylthio, arylthio, thiocarboxylate, sulfates, alkylsulfinyl, sulfonate, 
sulfamoyl, sulfonamido, nitro, trifluoromethyl, cyano, azido, heterocyclyl, alkyl, 
alkylaryl, or an aromatic or heteroaromatic moiety. 

The term "heteroatom" includes atoms of any element other than carbon or 

1 0 hydrogen. Preferred heteroatoms are nitrogen, oxygen, sulfur and phosphorus. 

It will be noted that the structure of some of the compounds of this invention 
includes asymmetric carbon atoms. It is to be understood accordingly that the isomers 
arising from such asymmetry (e.g., all enantiomers and diastereomers) are included 
within the scope of this invention, unless indicated otherwise. Such isomers can be 

1 5 obtained in substantially pure form by classical separation techniques and by 

stereochemically controlled synthesis. Furthermore, the structures and other compounds 
and moieties discussed in this application also include all tautomers thereof. 

Bonds represented by " " in a structural formula mean that the 

bond may be either a single or a double bond. 

20 

IX. Formulations Comprising Transcription factor Modulating Compounds 

The invention provides compositions which include a therapeutically- 
effective amount or dose of a transcription factor modulating compound and/or a 
compound identified in any of the instant assays and one or more carriers (e.g., 

25 pharmaceutical^ acceptable additives and/or diluents). The pharmaceutical 

compositions of the invention may comprise any compound described in this application 
as a transcription factor modulating compound, an AraC family polypeptide modulating 
compound, a MarA family polypeptide modulating compound, a MarA family inhibiting 
compound, a MarA inhibiting compound, compounds of formulae (I), (II), (HI), (TV), 

30 (V), (VI), (VII), (Vm), (DC), (X), Table 4, Table 5, Table 6, Table 7, Table 8, scaffold, 
etc. Each of these compounds may be used alone of in combination as a part of a 
pharmaceutical composition of the invention. Furthermore, a composition can also 
include a second antimicrobial agent, e.g, an antibiotic. 

The invention pertains to pharmaceutical compositions comprising an 

35 effective amount of a transcription factor modulating compound (e.g., a MarA family 
polypeptide modulating compound or an AraC family polypeptide modulating 
compound), and a pharmaceutically acceptable carrier. In one embodiment, the 
transcription factor modulating compound is of the formula (EQ: 
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(D) 



wherein 



5 



Wis O or S; 

X is O, S, or C, optionally linked to Q; 

A^C-Z^O.arS; 

A 2 is C-Z 5 ,orN-Z 5 ; 

Z 1 , Z 2 , Z 3 , Z 4 and Z 5 are each independently hydrogen, alkoxy, hydroxy, 



halogen, alkyl, alkenyl, alkynyl, aryl, heterocyclic, amino, or cyano; 

Z 3 is hydrogen, alkoxy, hydroxy, halogen, alkyl, alkenyl, alkynyl, aryl, 
10 heterocyclic, amino, nitro, cyano, carbonyl, or thiocarbonyl; 

Q is an aromatic or heterocyclic moiety, and pharmaceutically acceptable 

salts thereof. 

In another embodiment, the pharmaceutical compositions of the invention 
15 include an effective amount of a transcription factor modulating compound of the 
formula (HI): 



G is substituted or unsubstituted aromatic moiety, heterocyclic, alkyl, 
20 alkenyl, alkynyl, hydroxy, cyano, nitro, amino, carbonyl, or hydrogen; and 

L\ L 2 , L 3 , L 4 , L 5 , L 6 , L 7 , L 8 , L 9 , and L 10 are each independently oxygen, 
substituted or unsubstituted nitrogen, sulfur and or substituted or unsubstituted carbon, 
and pharmaceutically acceptable salts thereof. 

25 In yet another embodiment, the pharmaceutical compositions of the 

invention include a pharmaceutically acceptable carrier (optional) and an effective 
amount of a transcription factor modulating compound of the formula (IV): 




(m) 



wherein 
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10 



25 




(IV) 

wherein 

Y 1 and Y 2 are each oxygen or sulfur; 

J is hydrogen, substituted or unsubstituted alkyl, alkenyl, alkynyl, cyano, 
nitro, amino, or halogen; 

V is substituted or unsubstituted alkyl, alkenyl, alkynyl, alkoxy, 

alkylamino, or alkylthio; 

P and K are each independently substituted or unsubstituted aiyl, and 

pharmaceutically acceptable salts thereof. 

In yet another embodiment, the pharmaceutical compositions of the 
invention include a pharmaceutically acceptable carrier (optional) and an effective 
amount of a transcription factor modulating compound of the formula (V): 

wherein 

T\ T 2 , T 3 , T 4 , T 5 , and T 6 are each independently substituted or 
unsubstituted carbon, oxygen, substituted or unsubstituted nitrogen, or sulfur; 

M is hydrogen, alkyl, alkenyl, alkynyl, or aryl, or pharmaceutically 
20 acceptable salts thereof. 

In yet another embodiment, the pharmaceutical compositions of the 
invention include a pharmaceutically acceptable carrier (optional) and an effective 
amount of a transcription factor modulating compound of the formula (Va): 























k 4 





(Va) 
wherein 

R 1 is OH, OCOC0 2 H, or a substituted or unsubstituted straight or 
branched C1-C5 alkyloxy group; 
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R is H, C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched 
alkyl), or a substituted or unsubstituted aryl group; and 

R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of 
H, (C1-C5 substituted or unsubstituted, straight or branched alkyl), 
5 CC>2(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 
CO(C3-C6 substituted or unsubstituted cycloalkyl), 
0(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
10 C(NOH)(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 

substituted or unsubstituted amino, C0 2 H, CN, N0 2 , CONH 2 , (CO)(NHOH), and 
halogen. 

In certain embodiments of formula Va, those compounds disclosed in 
U.S. 10/139,591, filed May 6, 2002, are excluded from the scope of the present 
15 invention. 

hi other embodiments of formula Va, wljen R 6 is N0 2 and R 2 is 
unsubstituted phenyl, then R 1 is not 0(CHCH 3 )(C02)CH 2 CH3 or OCH 2 C0 2 H. Also, in 
another embodiment, when R 6 is H or NC^, then R 1 is not a phenyl-substituted alkyloxy 
group. In yet another embodiment, when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is para- 
20 methoxyphenyl, then R 1 is not OH. And in another embodiment, when R 4 , R 5 , R 6 , and 
R 7 are all H and R 2 is unsubstituted phenyl, then R 1 is not OCH 2 C0 2 CH 2 CH 3 ; 

In certain aspects of formula Va, R 4 , R 5 , and R 7 are all H. 

Similarly, R 1 of formula Va maybe selected from the group consisting of 
OH, 0(CR'R")i- 3 H, 0(CR'R")i- 3 OH, O(CR'R'0i-3CO 2 H, 0(CR'R* V3C0 2 (CR'R")i- 
25 3 H, OCCR'R^wCCOJNHi, OCCR'R'OwCCNIONH^ OCOC0 2 H, 0(CR'R")i- 3 S0 3 H, 
0(CR'R")i- 3 OS03H, 0(CR'R")i-3P03H, 0(CR'R")i-30P0 3 H, 
0(CR'R ,5 )i- 3 N[(CR 5 R 9 V3H] 2 , O(CR^0i-3(CO)(NHOH), and 0(CR'R")i- 
3 (heteroaryl); 

wherein R' and R" are each independently H, a Ci-C 3 alkyl, C2-C3 alkenyl, or Cr-C 3 
30 alkynyl group. Each R' and R" is preferably H or CH 3 . 

When R l of formula Va is OCCR'R'Oi^Cheteroaryl), the heteroaryl group 
may be apyrrolyl, furanyl, thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, 
tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or 
pyrimidinyl group. 

35 Similarly, when R 2 of formula Va may be a substituted or unsubstituted 

phenyl, pyrrolyl, furanyl, thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, 
tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or 
pyrimidinyl group. 
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In a more particular embodiment, R 6 of formula Va is H, (CR'R'OwH, 
(CR'R")wOH, (CR'R'VaNH,, (NOH)(CR'R")i-3H, CO(CR'R' V3NH2, 
CO(CR'R")i- 3 H, CO(CR'R")i-30H, COtCR'R'VsCF^ 
(CR , R ,, )o-3N[(CR , R")o-3H]2, CO(substituted or unsubstituted heteroaryl), 

5 CO(C 3 -C 6 substituted or unsubstituted cycloalkyl), ©(CR'R^i-aH, 

CO(substituted or unsubstituted phenyl), C0 2 (CR'R")o-3H, CN, N0 2 , F, CI, Br, or I, 
wherein R' and R" are each independently H, a C1-C3 alkyl, C2-C3 alkenyL, or C2-C3 
alkynyl group. Preferably each R' and R" is independently H or CH 3 . 

In yet another embodiment, R 6 of formula Va is CO(substituted or 

10 unsubstituted heteroaryl), wherein said heteroaryl group is a pyrrolyl, furanyl, 

thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, tetrazolyl, pyrazolyl, oxazolyl, 
isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or pyrimidinyl group. 

In still another embodiment of formula Va, R 1 is OH, OCOCO2H, or a 
substituted straight or branched C1-C5 alkyloxy group, provided that R l is not a 2-amino- 

1 5 substituted ethoxy group or a substituted or unsubstituted benzyloxy group; R 2 is H, 
C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), or a substituted or 
unsubstituted aryl group, provided that said aryl group is not a thiazolyl or isothiazolyl 
group; and R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 
(C1-C5 substituted or unsubstituted, straight or branched alkyl), C0 2 (Ci-C 5 substituted or 

20 unsubstituted, straight or branched alkyl), CO(C r C 5 substituted or unsubstituted, 

straight or branched alkyl), CO(substituted or unsubstituted aryl or heteroaryl), CO(C 3 - 
C 6 substituted or unsubstituted cycloalkyl), 0(C r C 5 substituted or unsubstituted, straight 
or branched alkyl), C(NOH)(Ci-C 5 substituted or unsubstituted, straight or branched 
alkyl), substituted or unsubstituted amino, C0 2 H, CN, N0 2 , CONH 2 , (CO)(NHOH), and 

25 halogen; provided that when R 6 is N0 2 and R 2 is unsubstituted phenyl, then R l is not 
0(CHCH 3 )(C0 2 )CH 2 CH 3 or OCH 2 C0 2 H; provided that when R 6 is H or N0 2 , then R 1 is 
not a phenyl-substituted alkyloxy group; provided that when R 4 , R 5 , R 6 , and R 7 are all H 
and R 2 is para-methoxyphenyl, then R 1 is not OH; and provided that when R 4 , R 5 , R 6 , 
and R 7 are all H and R 2 is unsubstituted phenyl, or when R 4 , R 5 , and R 7 are all H, R 6 is 

30 CI, and R 2 is /rara-methyl-phenyl, then R 1 is not OCH 2 C0 2 CH 2 CH 3 . 

In yet another embodiment, the pharmaceutical compositions of the 
invention include a pharmaceutical^ acceptable carrier (optional) and an effective 
amount of a transcription factor modulating compound of the formula (VI): 
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(VI) 
wherein 

G 1 , G 2 , and G 3 are each independently O, S, substituted or unsubstituted 
nitrogen, or substituted or unsubstituted carbon; 
5 E l , E 2 , and E 3 are each independently hydrogen, alkyl, alkenyl, alkynyl, 

aryl, aralkyl, or acyl; and 

E 4 is alkyl, alkenyl, alkynyl, aryl, halogen, cyano, amino, nitro, or acyl, 
and phannaceutically acceptable salts thereof 

10 In yet another further embodiment, the pharmaceutical compositions of 

the invention comprise an effective amount of a transcription factor modulating 
compound listed below or found in Table 4, Table 5, Table 6, Table 7, Table 8: 
2-(4-isopropylphenyl)-4H-chromen-4-one; 
2-(3,4-Dihydroxy-phenyl)-3,5,7»trihydroxy-chromen-4-one 
15 N-isopropyl-2-[(4-me%l-5-quinolin-6-yl-4H4,2,4-triazol-3- 
yl)thio]acetamide; 

4-hydroxy-6-me%l-5,6-dihydro^ 
5,7-Dihydroxy-2-(4-hydroxy-phenyl)-chromen-4-one; 
2-[4-(dimethylamino)phenyl]-4H-chromen-4-one; 
20 1 -(benzyloxy)-2-phenyl- lH-imidazo[4,5-b]pyridine; 

2-(benzylthioH-phenyl-5<l-phenyl4H-l^,3,4-tetraazol-5- 

yl)pyrimidine; 

6- fluoro-2-phenyl-4H-chromen-4-one; 

7- methoxy-2-phenyl-4H-chromen-4-one; 

25 4-(l ,3-dioxo-l ,3-dihydro-2H-inden-2-yliden)-2-phenyl-6-(2- 

pyridinyl)tetrahydropyn-olo[3,4-c]pynx)le-l,3(2H,3aH) 

2-(2-Hydroxy-3-oxo-5-p4olyl-2,3-dihydro-fiu-an-2-yl)-malonamic acid 

ethyl ester; 

2-[(6-nitro-2-phenyl-lH4,3-benzimidazol-l-yl)oxy]acetic acid; 
30 2-(4-fluorophenyl)-4H-chromen-4-one; 

l-methoxy-2-(4-methylphenyl)-lH-imidazo[4,5-b]pyridine; 
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6-(5Jodo-furan-2-yl)-3-methylsulfanyl-6J-dihydro 
tetraaza-dibenzo [a,c]cycloheptene; 

6-(4-Ethoxy-phenyl)-3-methylsulf^^ 
tetraaza-dibenzo [a,c] cycloheptene; 
5 3-Methylsulfanyl-6-(5-m^ 
tetraaza-dibenzo [a,c]cycloheptene; 

3- Methylsulfanyl-6-[5-(4-nitro^ 
1,2,4,7-tetraaza -dibenzo [a,c] cycloheptene; 

4- (3-Ethylsulfanyl-6J-dihydro-5-oxa-l,2,4,7-tetraaza -dibenzo [a,c] 
10 cyclohepten-6-yl)-benzene-l,2-diol; 

6-(4-Benzyloxy-phenyl)-3-propylsulfanyl-6J-dihydro-5-oxa-l,2,4,7- 
tetraaza-dibetizo [a,c]cycloheptene; 

6-Benzo[l,3]dioxol-5-yl-3-methylsul^^ 
tetraaza-dibenzo [a,c]cycloheptene; 
1 5 3-Butylsulfanyl-6-(2,4-dimethoxy-phenyl)-6,7-dihydro-5-oxa- 1 ,2,4,7- 

tetraaza -dibenzo[a,c] cycloheptene; 

6-(4-Allyloxy-phenyl)-3-butylsulfanyl-6,7-dihy<ko-5-oxa- 1 ,2,4,7- 
tetraaza-dibenzo[a,c]cycloheptene; 

3-Butylsidfanyl-6-(4-ethoxy-phenyl)-6,7-dihydro-5-oxa-l,2,4,7-tetra 
20 dibenzo [a,c] cycloheptene; 

6-(4-Methoxy-phenyl)-3-propylsulfanyl-6,7-dihydro-5-oxa- 1 ,2,4,7- 
tetraaza-dibenzo [a,c]cycloheptene; 

6-[5-(3-Nitro-phenyl)-furan-2-yl]^ 
1,2,4,7-tetraaza -dibenzo[a,c] cycloheptene; 
25 2-(3-Phenyl-lH-pyrazol-4-ylmethylene)-benzo[4,5] imidazo[2,l- 

b]thiazol-3-one; 

2-[5-(3-Carboxy-phenyl)-fiiran-2-ylm^ 

1- yl)-7-me%l-3-oxo-2,3-d^ acid ethyl 
ester; 

30 5-(4-Dimethylamino-phenyl)-7-meto^ 

furan-2-yl methylene]-3-oxo-2,3-dihydro -5H-thiazolo[3,2-a] pVrimidine-6-carboxylic 
acid ethyl ester; 

5- Benzo[l ,3]dioxol-5-yl-7-methyl^ 

2- yl methylene]-3-oxo-2,3-dihydro -5H-thiazolo[3,2-a] pyrimidine-6-carboxylic acid 
35 ethyl ester, 

5-(3,4-Dimethoxy-phenyl)-7-metty^ 
fiiran-2-yl me%lene]-3-oxo-2,3-dihydro-5H-t^ 
acid ethyl ester; 
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7-Methyl-2-[5-(2-methyl^m 
methyl sulfanyl-phenyl)-3-oxo-23-dihydro-5H-thiazolo[3,2-a] pyrimidine-6-carboxylic 
acid ethyl ester; 

2-[5-(4-Carboxy-phenyl)-fiu^-2-ylm 
5 l-yl)-7-methyl-3-oxo-2,3-dihydro-5H-to acid ethyl 1 

ester; 

5-Benzo[l ,3]cttoxol-5-yl-2-[5-(4-eth^ 
ylmethylene]-7-methyl-3-oxo-2,3-dih^^ 
acid ethyl ester; 
10 7-Methyl-3-oxo-5-phenyl-2-[5^^ 

ylmethylene]-2,3-dihydro-5H-thiazo^ acid ethyl ester; 

7-Methyl-2-[5-(2-methyl-4-nito^ 
phenyl-2,3-dihy<ko-5H-thiazolo^ acid ethyl ester, 

2-[5 -(3 -Carboxy-phenyl)-furan-2-ylmethylene] -5 -(4-dimethylamino- 
15 phenyl)-7-me%l-3-oxo-2,3-dihydro-^^ 
ethyl ester; 

5- (4-Dimethylamino-phenyl)-7-m 
furan-2-yl me%lene]-3^xo-2,3K^ 

acid ethyl ester; 
20 2-[5-(3-Carboxy-phenyl)-fu^ 
methylsulfanyl-phenyl)-3-o^^ 
acid ethyl ester; 

[l,2]Naphthoquinone 1 - [0-(6-oxo-6H-anthra[ 1 ,9-cd] isoxazol-5-yl)- 

oximej; 

25 3-Acetyl-2,5,7-triphenyHH-l ,3M,8-tetraaza-7a-azonia- 

cyclopenta[a]indene; 

1 -Amino-3-benzo[ 1 ,3]dioxol-5-yl-benzo[4,5]imidazo[l ,2-a] pyridine-2,4- 

dicarbonitrile; 

2-[2-(5-Furan-2-yl-4-ph^ 
30 acetylaminoj-benzoic acid methyl ester; 

6 9 7-Dimethyl-2-(3-phenyl-lH-pyrazol-4-ylmethylene)- 
benzo[4,5]imidazo[2,l-b] thiazol-3-one; 

2-(5-Benzo[ U,5]oxadiazol«5-yl-4-methyMH-[l ,2,4] triazol-3-ylsulfanyl)-N<3- 
methylsulfanyl-phenyl)-acetamide; 
35 4-(l ,3-Dioxo-indan-2-yhdene)-2-pheny^ 

pyrrolo[3,4-c] pyrrole-l,3-dione; 

6- Nitro-2-phenyl-l-(34rifluorometh^ 
(6-Nitro-2-phenyl-benzoimidazoH -yloxy)-acetic acid; 
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l-Benzyloxy-6-rdtro-2-phenyl-lH-benzoiinidazole; 

1- (4-Methyl-benzyloxy)-6-rritro-2^ 
6,8-Dimethyl-2-(4-nifro-phenyl^^ 

cyclopenta[a]naphthalene-4,7,9-trione; 
5 6,8-Dimethyl-5-phenyl-2-p-toly^ 
cyclopenta [a]naphthalene-4,7,9-trione; 

2- [3-(4-Fluoro-phenyl)»l-phenyHH-pyrazol-4-yl methy!ene]-benzo [4,5] 
imidazo [2, 1 -bJthiazol-3-one; 

Cobalt 5 J0,15,20-Tetra-pyridin^yl-porphyrine; 
10 2-[3^4-Fluoro-phenyl)-l-phen^ 
vinyl-imidazo[2,l-b]thiazol-3-one; 

Cobalt 5,10,15,20-Tetra-pyridin-3-yl-porpbyrine; 
Zinc 5, 1 0, 1 5,20-Tetra-pyridin-4-yl-porphyrine; 
2-(4-hydroxyphenyl)-4H-chromen-4-one, and pharmaceutical^ 
1 5 acceptable salts thereof. 

In another embodiment, the method for preventing a bacterial associated 
state in a subject, comprising administering to the subject an effective amount of a 
transcription factor modulating compound, such that the bacterial associated state is 
prevented. 

20 The term "subject* ' includes plants and animals (e.g., vertebrates, 

amphibians, fish, mammals, e.g., cats, dogs, horses, pigs, cows, sheep, rodents, rabbits, 
squirrels, bears, primates (e.g., chimpanzees, gorillas, and humans) which are capable of 
suffering from a bacterial associated disorder. The term "subject' 9 also comprises 
immunocompromised subjects, who may be at a higher risk for infection. 

25 The term preventing" the administration of an effective amount of the 

transcription factor modulating compound to prevent a bacterial associated state from 
occurring. 

The term "bacterial associated state" includes states characterized by the 
presence of bacteria which can be prevented by administering the transcription factor 
30 modulating compounds of the invention. The term includes biofilm associated states and 
other infections or the undesirable presence of a bacteria on or in a subject. 

As described in detail below, the pharmaceutical compositions can be 
formulated for administration in solid or liquid form, including those adapted for the 
following: (1) oral administration, for example, aqueous or non-aqueous solutions or 
35 suspensions, tablets, boluses, powders, granules, pastes; (2) parental administration, for 
example, by subcutaneous, intramuscular or intravenous injection as, for example, a 
sterile solution or suspension; (3) topical application, for example, as a cream, ointment 
or spray applied to the skin; (4) intravaginally or intrarectally, for example, as a pessary, 
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cream, foam, or suppository; or (5) aerosol, for example, as an aqueous aerosol, 
liposomal preparation or solid particles containing the compound. 

The phrase "pharmaceutically-acceptable carrier" as used herein means a 
pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid 
5 filler, diluent, excipient, solvent or encapsulating material, involved in carrying or 
transporting the antiinfective agents or compounds of the invention from one organ, or 
portion of the body, to another organ, or portion of the body without affecting its 
biological effect. Each carrier should be "acceptable" in the sense of being compatible 
with the other ingredients of the composition and not injurious to the subject. Some 

10 examples of materials which can serve as pharmaceutically-acceptable carriers include: 
(1) sugars, such as lactose, glucose and sucrose; (2) starches, such as com starch and 
potato starch; (3) cellulose, and its derivatives, such as sodium carboxymethyl cellulose, 
ethyl cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) 
talc; (8) excipients, such as cocoa butter and suppository waxes; (9) oils, such as peanut 

15 oil, cottonseed oil, safflower oil, sesame oil, olive oil, com oil and soybean oil; (10) 
glycols, such as propylene glycol; (1 1) polyols, such as glycerin, sorbitol, mannitol and 
polyethylene glycol; (12) esters, such as ethyl oleate and ethyl laurate; (13) agar, (14) 
buffering agents, such as magnesium hydroxide and aluminum hydroxide; (15) alginic 
acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's solution; (19) ethyl 

20 alcohol; (20) phosphate buffer solutions; and (21) other non-toxic compatible substances 
employed in pharmaceutical compositions. Proper fluidity can be maintained, for 
example, by the use of coating materials, such as lecithin, by the maintenance of the 
required particle size in the case of dispersions, and by the use of surfactants. 

These compositions may also contain adjuvants such as preservatives, 

25 wetting agents, emulsifying agents and dispersing agents. Prevention of the action of 
microbes may be ensured by the inclusion of various antibacterial and antifungal agents, 
for example, paraben, chlorobutanol, phenol sorbic acid, and the like. It may also be 
desirable to include isotonic agents, such as sugars, sodium chloride, and the like into the 
compositions. In addition, prolonged absorption of the injectable pharmaceutical form 

30 may be brought about by the inclusion of agents which delay absorption such as 
aluminum monostearate and gelatin. 

In some cases, in order to prolong the effect of a drug, it is desirable to 
slow the absorption of the drug from subcutaneous or intramuscular injection. This may 
be accomplished by the use of a liquid suspension of crystalline or amorphous material 

35 having poor water solubility. The rate of absorption of the drug then depends upon its 
rate of dissolution which, in turn, may depend upon crystal size and crystalline form. 
Alternatively, delayed absorption of a parenterally-administered drug form is 
accomplished by dissolving or suspending the drug in an oil vehicle. 
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Pharmaceutical compositions of the present invention maybe 
administered to epithelial surfaces of the body orally, parenterally, topically, rectally, 
nasally, intravaginally, intracisternally. They are of course given by forms suitable for 
each administration route. For example, they are administered in tablets or capsule form, 
5 by injection, inhalation, eye lotion, ointment, etc., administration by injection, infusion 
or inhalation; topical by lotion or ointment; and rectal or vaginal suppositories. 

The phrases "parenteral administration" and "administered parenterally" 
as used herein mean modes of administration other than enteral and topical 
administration, usually by injection, and includes, without limitation, intravenous, 
10 intramuscular, intraarterial, intrathecal, intracapsular, intraorbital, intracardiac, 

intradermal, intraperitoneal, transtracheal, subcutaneous, subcuticular, intraarticular, 
subcapsular, subarachnoid, intraspinal and intrasternal injection and infusion. 

The phrases "systemic administration," "administered systemically," 
"peripheral administration" and "administered peripherally" as used herein mean the 
15 administration of a sucrose octasulfate and/or an antibacterial, drug or other material 
other than directly into the central nervous system, such that it enters the subject's system 
and, thus, is subject to metabolism and other like processes, for example, subcutaneous 
administration. 

In some methods, the compositions of the invention can be topically 

20 administered to any epithelial surface. An "epithelial surface" according to this 

invention is defined as an area of tissue that covers external surfaces of a body, or which 
lines hollow structures including, but not limited to, cutaneous and mucosal surfaces. 
Such epithelial surfaces include oral, pharyngeal, esophageal, pulmonary, ocular, aural, 
nasal, buccal, lingual, vaginal, cervical, genitourinary, alimentary, and anorectal surfaces. 

25 Compositions can be formulated in a variety of conventional forms 

employed for topical administration. These include, for example, semi-solid and liquid 
dosage forms, such as liquid solutions or suspensions, suppositories, douches, enemas, 
gels, creams, emulsions, lotions, slurries, powders, sprays, lipsticks, foams, pastes, 
toothpastes, ointments, salves, balms, douches, drops, troches, chewing gums, lozenges, 

30 mouthwashes, rinses. 

Conventionally used carriers for topical applications include pectin, 
gelatin and derivatives thereof, polylactic acid or polyglycolic acid polymers or 
copolymers thereof, cellulose derivatives such as methyl cellulose, carboxymethyl 
cellulose, or oxidized cellulose, guar gum, acacia gum, karaya gum, tragacanth gum, 

35 bentonite, agar, carbomer, bladderwrack, ceratonia, dextran and derivatives thereof, 

ghatti gum, hectorite, ispaghula husk, polyvinypyrrolidone, silica and derivatives thereof, 
xanthan gum, kaolin, talc, starch and derivatives thereof, paraffin, water, vegetable and 
animal oils, polyethylene, polyethylene oxide, polyethylene glycol, polypropylene glycol, 
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glycerol, ethanol, propanol, propylene glycol (glycols, alcohols), fixed oils, sodium, 
potassium, aluminum, magnesium or calcium salts (such as chloride, carbonate, 
bicarbonate, citrate, gluconate, lactate, acetate, gluceptate or tartrate). 

Such compositions can be particularly useful, for example, for treatment 
5 or prevention of an unwanted cell, e.g., vaginal Neisseria gonorrhoeae, or infections of 
the oral cavity, including cold sores, infections of eye, the skin, or the lower intestinal 
tract. Standard composition strategies for topical agents can be applied to the 
antiinfective compounds or a phaimaceutically acceptable salt thereof in order to 
enhance the persistence and residence time of the drug, and to improve the prophylactic 

1 0 efficacy achieved. 

For topical application to be used in the lower intestinal tract or vaginally, 
a rectal suppository, a suitable enema, a gel, an ointment, a solution, a suspension or an 
insert can be used. Topical transdermal patches may also be used. Transdermal patches 
have the added advantage of providing controlled delivery of the compositions of the 

1 5 invention to the body. Such dosage forms can be made by dissolving or dispersing the 
agent in the proper medium. 

Compositions of the invention can be administered in the form of 
suppositories for rectal or vaginal administration. These can be prepared by mixing the 
agent with a suitable non-irritating carrier which is solid at room temperature but liquid 

20 at rectal temperature and therefore will melt in the rectum or vagina to release the drug. 
Such materials include cocoa butter, beeswax, polyethylene glycols, a suppository wax 
or a salicylate, and which is solid at room temperature, but liquid at body temperature 
and, therefore, will melt in the rectum or vaginal cavity and release the active agent. 

Compositions which are suitable for vaginal administration also include 

25 pessaries, tampons, creams, gels, pastes, foams, films, or spray compositions containing 
such carriers as are known in the art to be appropriate. The carrier employed in the 
sucrose octasulfate /contraceptive agent should be compatible with vaginal 
administration and/or coating of contraceptive devices. Combinations can be in solid, 
semi-solid and liquid dosage forms, such as diaphragm, jelly, douches, foams, films, 

30 ointments, creams, balms, gels, salves, pastes, slurries, vaginal suppositories, sexual 
lubricants, and coatings for devices, such as condoms, contraceptive sponges, cervical 
caps and diaphragms. 

For ophthalmic applications, the pharmaceutical compositions can be 
formulated as micronized suspensions in isotonic, pH adjusted sterile saline, or, 

35 preferably, as solutions in isotonic, pH adjusted sterile saline, either with or without a 
preservative such as benzylalkonium chloride. Alternatively, for ophthalmic uses, the 
compositions can be formulated in an ointment such as petrolatum. Exemplary 
ophthalmic compositions include eye ointments, powders, solutions and the like. 
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Powders and sprays can contain, in addition to sucrose octasulfate and/or 
antibiotic or contraceptive agent(s), carriers such as lactose, talc, aluminum hydroxide, 
calcium silicates and polyamide powder, or mixtures of these substances. Sprays can 
additionally contain customary propellants, such as chlorofluorohydrocarbons and 
5 volatile unsubstituted hydrocarbons, such as butane and propane. 

Ordinarily, an aqueous aerosol is made by formulating an aqueous 
solution or suspension of the agent together with conventional pharmaceutically 
acceptable carriers and stabilizers. The carriers and stabilizers vary with the 
requirements of the particular compound, but typically include nonionic surfactants 
10 (Tweens, Pluronics, or polyethylene glycol), proteins like serum albumin, sorbitan esters, 
oleic acid, lecithin, amino acids such as glycine, buffers, salts, sugars or sugar alcohols. 
Aerosols generally are prepared from isotonic solutions. 

Compositions of the invention can also be orally administered in any 
orally-acceptable dosage form including, but not limited to, capsules, cachets, pills, 
15 tablets, lozenges (using a flavored basis, usually sucrose and acacia or tragacanth), 

powders, granules, or as a solution or a suspension in an aqueous or non-aqueous liquid, 
or as an oil-in-water or water-in-oil liquid emulsion, or as an elixir or syrup, or as 
pastilles (using an inert base, such as gelatin and glycerin, or sucrose and acacia) and/or 
as mouth washes and the like, each containing a predetermined amount of sucrose 
20 octasulfate and/or antibiotic or contraceptive agent(s) as an active ingredient. A 

compound may also be administered as a bolus, electuary or paste. In the case of tablets 
for oral use, carriers which are commonly used include lactose and corn starch. 
Lubricating agents, such as magnesium stearate, are also typically added. For oral 
administration in a capsule form, useful diluents include lactose and dried com starch. 
25 When aqueous suspensions are required for oral use, the active ingredient is combined 
with emulsifying and suspending agents. If desired, certain sweetening, flavoring or 
coloring agents may also be added. 

Tablets, and other solid dosage forms, such as dragees, capsules, pills and 
granules, may be scored or prepared with coatings and shells, such as enteric coatings 
30 and other coatings well known in the pharmaceutical-formulating art. They may also be 
formulated so as to provide slow or controlled release of the active ingredient therein 
using, for example, hydroxypropylmethyl cellulose in varying proportions to provide the 
desired release profile, other polymer matrices, liposomes and/or microspheres. They 
may be sterilized by, for example, filtration through a bacteria-retaining filter, or by 
35 incorporating sterilizing agents in the form of sterile solid compositions which can be 
dissolved in sterile water, or some other sterile injectable medium immediately before 
use. These compositions may also optionally contain opacifying agents and may be of a 
composition that they release the active ingredient(s) only, or preferentially, in a certain 
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portion of the gastrointestinal tract, optionally, in a delayed manner. Examples of 
embedding compositions which can be used include polymeric substances and waxes. 
The active ingredient can also be in micro-encapsulated form, if appropriate, with one or 
more of the above-described excipients. 

5 Liquid dosage forms for oral administration include pharmaceutical^ 

acceptable emulsions, microemulsions, solutions, suspensions, syrups and elixirs. In 
addition to the active ingredient, the liquid dosage forms may contain inert diluents 
commonly used in the art, such as, for example, water or other solvents, solubilizing 
agents and emulsifiers, such as ethyl alcohol, isopropyl alcohol, ethyl carbonate, ethyl 

10 acetate, benzyl alcohol, benzyl benzoate, propylene glycol, 1,3-butylene glycol, oils (in 
particular, cottonseed, groundnut, corn, germ, olive, castor and sesame oils), glycerol, 
tetrahydrofuryl alcohol, polyethylene glycols and fatty acid esters of sorbitan, and 
mixtures thereof. 

Besides inert diluents, the oral compositions can also include adjuvants 

15 such as wetting agents, emulsifying and suspending agents, sweetening, flavoring, 
coloring, perfuming and preservative agents. 

Suspensions, in addition to the antiinfective agent(s) may contain 
suspending agents as, for example, ethoxylated isostearyl alcohols, polyoxyethylene 
sorbitol and sorbitan esters, microcrystalline cellulose, aluminum metahydroxide, 

20 bentonite, agar-agar and tragacanth, and mixtures thereof. 

Sterile injectable forms of the compositions of this invention can be 
aqueous or oleaginous suspension. These suspensions maybe formulated according to 
techniques known in the art using suitable dispersing or wetting agents and suspending 
agents. Wetting agents, emulsifiers and lubricants, such as sodium lauryl sulfate and 

25 magnesium stearate, as well as coloring agents, release agents, coating agents, 

sweetening, flavoring and perfuming agents, preservatives and antioxidants can also be 
present in the compositions. 

The sterile injectable preparation may also be a sterile injectable solution 
or suspension in a nontoxic parenterally-acceptable diluent or solvent, for example as a 

30 solution in 1,3-butanediol. Among the acceptable vehicles and solvents that may be 
employed are water, Ringer's solution and isotonic sodium chloride solution, hi 
addition, sterile, fixed oils are conventionally employed as a solvent or suspending 
medium. For this purpose, any bland fixed oil may be employed including synthetic 
mono-or di-glycerides. Fatty acids, such as oleic acid and its glyceride derivatives are 

35 useful in the preparation of injectables, as are natural pharmaceutically-acceptable oils, 
such as olive oil or castor oil, especially in their polyoxyethylated versions. These oil 
solutions or suspensions may also contain a long-chain alcohol diluent or dispersant, 
such as Ph. Helv or similar alcohol. 
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The antiinfective agent or a pharmaceutical acceptable salt thereof will 
represent some percentage of the total dose in other dosage forms in a material fo rming a 
combination product, including liquid solutions or suspensions, suppositories, douches, 
enemas, gels, creams, emulsions, lotions slurries, soaps, shampoos, detergents, powders, 
5 sprays, lipsticks, foams, pastes, toothpastes, ointments, salves, balms, douches, drops, 
troches, lozenges, mouthwashes, rinses and others. Creams and gels for example, are 
typically limited by the physical chemical properties of the delivery medium to 
concentrations less than 20% (e.g, 200 mg/gm). For special uses, far less concentrated 
preparations can be prepared, (e.g., lower percent formulations for pediatric 

10 applications). For example, the pharmaceutical composition of the invention can 
comprise sucrose octasulfate in an amount of 0.001-99%, typically 0.01-75%, more 
typically 0.1-20%, especially 1-10% by weight of the total preparation. In particular, a 
preferred concentration thereof in the preparation is 0.5-50%, especially 0.5-25%, such 
as 1-10%. It can be suitably applied 1-10 times a day, depending on the type and 

1 5 severity of the condition to be treated or prevented. 

Given the low toxicity of an antiinfective agent or a pharmaceutically 
acceptable salt thereof over many decades of clinical use as an anti-ulcerant [W.R. 
Garnett, Clin. Pharm. 1:307-314 (1982); R.N. Brogden et al., Drugs 27:194-209 (1984); 
D.M. McCarthy, NewEngJMed, 325:1017-1025 (1991), an upper limit for the 

20 therapeutically effective dose is not a critical issue. 

For prophylactic applications, the pharmaceutical composition of the 
invention can be applied prior to potential infection. The timing of application prior to 
potential infection can be optimized to maximize the prophylactic effectiveness of the 
compound. The timing of application will vary depending on the mode of 

25 administration, the epithelial surface to which it is applied, the surface area, doses, the 
stability and effectiveness of composition under the pH of the epithelial surface, the 
frequency of application, e.g., single application or multiple applications. One skilled in 
the art will be able to determine the most appropriate time interval required to maximize 
prophylactic effectiveness of the compound. 

30 The practice of the present invention will employ, unless otherwise 

indicated, conventional techniques of cell biology, cell culture, molecular biology, 
genetics, microbiology, recombinant DNA, and immunology, which are within the skill 
of the art. Such techniques are explained fully in the literature. See, for example, 
Genetics; Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, J. et al 

35 (Cold Spring Harbor Laboratory Press (1 989)); Short Protocols in Molecular Biology, 
3rd Ed., ed. by Ausubel, F. et al (Wiley, NY (1995)); DNA Cloning, Volumes I and II 
(D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed. (1984)); Mullis et al 
U.S. Patent No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins 
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eds. (1984)); the treatise, Methods In Enzymology (Academic Press, Inc., N.Y); 
Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., 
Academic Press, London (1987)); Handbook Of Experimental Immunology, Volumes I- 
IV (D. M. Weir and C. C. Blackwell, eds. (1986)); and Miller, J. Experiments in 
5 Molecular Genetics (Cold Spring Haibor Press, Cold Spring Harbor, N.Y. (1 972)). 

X. The Role of Transcription Activation Factor Polypeptides in Biofilms 

In one embodiment, the invention pertains to a method for dispersing or 

preventing the formation of a biofilm on a surface or in an area, by administering an 
10 effective amount of a transcription factor modulating compound, e.g., a HTH protein 

modulating compound, an AraC family polypeptide modulating compound, a MarA 

family polypeptide modulating compound, or a MarA inhibiting compound. 

It has been discovered that the absence of MarA and its homologs has a 

negative effect on biofilm formation in E. coli. In order to confirm this finding 
15 genetically, plasmid encoded mar A was transformed into an E. coli strain deleted of 

mar A, soxS, and rob (triple knockout). The expression of MarA in this triple knockout 

restored biofilm formation in this host to a level that was comparable to that of the wild 

type host. 

The term "biofilm" includes biological films that develop and persist at interfaces 

20 in aqueous and other environments. Biofilms are composed of microorganisms 

embedded in an organic gelatinous structure composed of one or more matrix polymers 
which are secreted by the resident microorganisms. The term "biofilm" also includes 
bacteria that are attached to a surface in sufficient numbers to be detected or 
communities of microorganisms attached to a surface (Costerton, J. W., etal (1987) 

25 Ann. Rev. Microbiol 41 :435-464; Shapiro, J. A. (1988) SciAm. 256:82-89; OToole, G. 
et al. (2000) Annu Rev Microbiol 54:49-79). 

In another embodiment, the invention pertains to methods of treating biofilm 
associated states in a subject, by administering to said subject an effective amount of a 
transcription factor modulating compound, e.g., a MarA family inhibiting compound, 

30 such that the biofilm associated state is treated. 

The term ''biofilm associated states" includes disorders which are characterized 
by the presence or potential presence of a bacterial biofilm. Many medically important 
pathogens form biofilms and biofilm formation is often one component of the infectious 
process (Costerton, J. W. et al (1999) Science 284:1318-1322). Examples of biofilm 

35 associated states include, but are not limited to, middle ear infections, cystic fibrosis, 
osteomyelitis, acne, dental cavities, and prostatitis. Biofilm associated states also 
include infection of the subject by one or more bacteria, e.g., Pseudomonas aeruginosa. 
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One consequence of biofilm formation is that bacteria within biofilms are generally less 
susceptible to a range of different antibiotics relative to their planktonic counterparts. 

Furtheimore, the invention also pertains to methods for preventing the 
formation of biofilms on surfaces or in areas, by contacting the area with an effective 
5 amount of a transcription factor modulating compound, e.g., a MarA family inhibiting 
compound, etc. 

Industrial facilities employ many methods of preventing biofouling of 
industrial water systems. Many microbial organisms are involved in biofilm formation 
in industrial waters. Growth of slime-producing bacteria in industrial water systems 

10 causes problems including decreased heat transfer, fouling and blockage of lines and 
valves, and corrosion or degradation of surfaces. Control of bacterial growth in the past 
has been accomplished with biocides. Many biocides and biocide formulations are 
known in the art. However, many of these contain components which may be 
environmentally deleterious or toxic, and are often resistant to breakdown. 

1 5 The transcription factor inhibiting compounds, such as but not limited to 

AraC family inhibiting compounds and MarA family inhibiting compounds, of the 
present invention are useful in a variety of environments including industrial, clinical, 
the household, and personal care. The compositions of the invention may comprise one 
or more compounds of the invention as an active ingredient acting alone, additively, or 

20 synergistically against the target organism. 

The MarA family inhibiting compounds and modulating compounds of 
the invention may be formulated in a composition suitable for use in environments 
including industry, pharmaceutics, household, and personal care. In an embodiment, the 
compounds of the invention are soluble in water. The modulating compounds may be 

25 applied or delivered with an acceptable carrier system. The composition may be applied 
or delivered with a suitable carrier system such that the active ingredient (e.g., 
transcription factor modulating compound of the invention such as a MarA family 
modulating compound, e.g., a MarA family polypeptide inhibiting compound) maybe 
dispersed or dissolved in a stable manner so that the active ingredient, when it is 

30 administered directly or indirectly, is present in a form in which it is available in a 
advantageous way. 

Also, the separate components of the compositions of the invention may 
be preblended or each component may be added separately to the same environment 
according to a predetermined dosage for the purpose of achieving the desired 

35 concentration level of the treatment components and so long as the components 

eventually come into intimate admixture with each other. Further, the present invention 
may be administered or delivered on a continuous or intermittent basis. 
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A transcription factor modulating compound, e.g., a MarA family 
modulating compound of the present invention, when present in a composition will 
generally be present in an amount from about 0.000001% to about 100%, more 
preferably from about 0.001% to about 50%, and most preferably from about 0.01% to 
5 about 25%. 

For compositions of the present invention comprising a carrier, the 
composition comprises, for example, from about 1% to about 99%, preferably from 
about 50% to about 99%, and most preferably from about 75% to about 99% by weight 
of at least one carrier. 

10 The transcription factor modulating compound, e.g., the MarA family 

polypeptide inhibiting compound, of the invention may be formulated with any suitable 
carrier and prepared for delivery in forms, such as, solutions, microemulsions, 
suspensions or aerosols. Generation of the aerosol or any other means of delivery of the 
present invention may be accomplished by any of the methods known in the art. For 

15 example, in the case of aerosol delivery, the compound is supplied in a finely divided 
form along with any suitable carrier with a propellant. Liquefied propellants are 
typically gases at ambient conditions and are condensed under pressure. The propellant 
may be any acceptable and known in the art including propane and butane, or other 
lower alkanes, such as those of up to 5 carbons. The composition is held within a 

20 container with an appropriate propellant and valve, and maintained at elevated pressure 
until released by action of the valve. 

The compositions of the invention may be prepared in a conventional 
form suitable for, but not limited to topical or local application such as an ointment, 
paste, gel, spray and liquid, by including stabilizers, penetrants and the carrier or diluent 

25 with the compound according to a known technique in the art. These preparations may 
be prepared in a conventional form suitable for enteral, parenteral, topical or inhalational 
applications. 

The present invention may be used in compositions suitable for 
household use. For example, compounds of the present invention are also useful as 

30 active antimicrobial ingredients in household products such as cleansers, detergents, 
disinfectants, dishwashing liquids, soaps and detergents. In an embodiment, the 
transcription factor modulating compound of the present invention maybe delivered in 
an amount and form effective for the prevention, removal or termination of microbes. 

The compositions of the invention for household use comprise, for 

35 example, at least one transcription factor modulating compound of the invention and at 
least one suitable carrier. ,For example, the composition may comprise from about 
0.00001% to about 50%, preferably from about 0.0001% to about 25%, most preferably 
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from about 0.0005% to about 10% by weight of the modulating compound based on the 
weight percentage of the total composition. 

The transcription factor modulating compound of the present invention 
may also be used in hygiene compositions for personal care. For instance, compounds of 
5 the invention can be used as an active ingredient in personal care products such as facial 
cleansers, astringents, body wash, shampoos, conditioners, cosmetics and other hygiene 
products. The hygiene composition may comprise any carrier or vehicle known in the art 
to obtain the desired form (such as solid, liquid, semisolid or aerosol) as long as the 
effects of the compound of the present invention are not impaired. Methods of 

10 preparation of hygiene compositions are not described herein in detail, but are known in 
the art. For its discussion of such methods, The CTFA Cosmetic Ingredient Handbook, 
Second Edition, 1992, and pages 5-484 of A Formulary of Cosmetic Preparations (Vol. 
2, Chapters 7-16) are incorporated herein by reference. 

The hygiene composition for use in personal care comprise generally at 

1 5 least one modulating compound of the present application and at least one suitable 
carrier. For example, the composition may comprise from about 0.00001% to about 
50%, preferably from about 0.0001% to about 25%, more preferably from about 
0.0005% to about 10% by weight of the transcription factor modulating compound of the 
invention based on the weight percentage of the total composition. 

20 The transcription factor modulating compound of the present invention 

may be used in industry. In the industrial setting, the presence of microbes can be 
problematic, as microbes are often responsible for industrial contamination and 
biofouling. Compositions of the invention for industrial applications may comprise an 
effective amount of the compound of the present invention in a composition for 

25 industrial use with at least one acceptable carrier or vehicle known in the art to be useful 
in the treatment of such systems. Such carriers or vehicles may include diluents, 
deflocculating agents, penetrants, spreading agents, surfactants, suspending agents, 
wetting agents, stabilizing agents, compatibility agents, sticking agents, waxes, oils, co- 
solvents, coupling agents, foams, antifoaming agents, natural or synthetic polymers, 

30 elastomers and synergists. Methods of preparation, delivery systems and carriers for 
such compositions are not described here in detail, but are known in the art. For its 
discussion of such methods, U.S. Patent No. 5,939,086 is herein incorporated by 
reference. Furthermore, the preferred amount of the composition to be used may vary 
according to the active ingredient(s) and situation in which the composition is being 

35 applied. 

The transcription factor modulating compounds, e.g., MarA family 
polypeptide inhibiting compounds, and compositions of the present invention may be 
useful in nonaqueous environments. Such nonaqueous environments may include, but 
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are not limited to, terrestrial environments, dry surfaces or semi-dry surfaces in which 
the compound or composition is applied in a manner and amount suitable for the 
situation. 

The transcription factor modulating compounds, e.g., MarA family 
5 polypeptide modulating compounds, e.g. , MarA inhibiting compounds, of the present 
invention may be used to form contact-killing coatings or layers on a variety of 
substrates including personal care products (such as toothbrushes, contact lens cases and 
dental equipment), healthcare products, household products, food preparation surfaces i 
and packaging, and laboratory and scientific equipment. Further, other substrates 

10 include medical devices such as catheters, urological devices, blood collection and 
transfer devices, tracheotomy devices, intraocular lenses, wound dressings, sutures, 
surgical staples, membranes, shunts, gloves, tissue patches, prosthetic devices (e.g., heart 
valves) and wound drainage tubes. Still further, other substrates include textile products 
such as carpets and fabrics, paints and joint cement. A further use is as an antimicrobial 

15 soil fumigant. 

The transcription factor modulating compounds of the invention may also 
be incorporated into polymers, such as polysaccharides (cellulose, cellulose derivatives, 
starch, pectins, alginate, chitin, guar, carrageenan), glycol polymers, polyesters, 
polyurethanes, polyacrylates, polyacrylonitrile, polyamides (e.g., nylons), polyolefins, 

20 polystyrenes, vinyl polymers, polypropylene, silks or biopolymers. The modulating 
compounds may be conjugated to any polymeric material such as those with the 
following specified functionality: 1) carboxy acid, 2) amino group, 3) hydroxyl group 
and/or 4) haloalkyl group. 

The composition for treatment of nonaqueous environments may be 

25 comprise at least one transcription factor modulating compound of the present ( 
application and at least one suitable carrier. In an embodiment, the composition 
comprises from about 0.001% to about 75%, advantageously from about 0.01% to about 
50%, and preferably from about 0.1% to about 25% by weight of a transcription factor 

modulating compound of the invention based on the weight percentage of the total | 

i 

30 composition. | 
The transcription factor modulating compounds and compositions of the j 
invention may also be useful in aqueous environments. "Aqueous environments" j 
include any type of system containing water, including, but not limited to, natural bodies 1 
of water such as lakes or ponds; artificial, recreational bodies of water such as swimming 

35 pools and hot tubs; and drinking reservoirs such as wells. The compositions of the 
present invention may be useful in treating microbial growth in these aqueous 
environments and may be applied, for example, at or near the surface of water. 
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The compositions of the invention for treatment of aqueous environments 
may comprise at least one transcription factor modulating compound of the present 
invention and at least one suitable carrier. In an embodiment, the composition comprises 
from about 0.001% to about 50%, advantageously from about 0.003% to about 15%, 
5 preferably from about 0.01% to about 5% by weight of the compound of the invention 
based on the weight percentage of the total composition. 

The present invention also provides a process for the production of an 
antibiofouling composition for industrial use. Such process comprises bringing at least 
one of any industrially acceptable carrier known in the art into intimate admixture with a 

10 transcription factor modulating compound of the present invention. The carrier may be 
any suitable carrier discussed aboye or known in the art. 

The suitable antibiofouling compositions may be in any acceptable form 
for delivery of the composition to a site potentially having, or having at le^st one living 
microbe. The antibiofouling compositions may be delivered with at least one suitably 

15 selected carrier as hereinbefore discussed using standard formulations. The mode of 
delivery may be such as to have a binding inhibiting effective amount of the 
antibiofouling composition at a site potentially having, or having at least one living 
microbe. The antibiofouling compositions of the present invention are useful in treating 
microbial growth that contributes to biofouling, such as scum or slime formation, in 

20 these aqueous environments. Examples of industrial processes in which these 
compounds might be effective include cooling water systems, reverse osmosis 
membranes, pulp and paper systems, air washer systems and the food processing 
industry. The antibiofouling composition may be delivered in an amount and form 
effective for the prevention, removal or termination of microbes. 

25 The antibiofouling composition of the present invention generally 

comprise at least one compound of the invention. The composition may comprise from 
about 0.001% to about 50%, more preferably from about 0.003% to about 15%, most 
preferably from about 0.01% to about 5% by weight of the compound of the invention 
based on the weight percentage of the total composition. 

30 The amount of antibiofouling composition may be delivered in an amount 

of about 1 mg/1 to about 1000 mg/1, advantageously from about 2 mg/1 to about 500 
mg/1, and preferably from about 20 mg/1 to about 140 mg/1- 

Antibiofouling compositions for water treatment generally comprise 
transcription factor modulating compounds of the invention in amounts from about 

35 0.001% to about 50% by weight of the total composition. Other components in the 

antibiofouling compositions (used at 0.1% to 50%) may include, for example, 2-bromo- 
2-nitropropane-l,3-diol (BNPD), P-nitrostyrene (BNS), dodecylguanidine hydrochloride, 
2,2-dibromo-3-nitrilopropionamide (DBNPA), glutaraldehyde, isothiazolin, methylene 
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bis(thiocyanate), triazines, n-alkyl dimethylbenzylammonium chloride, trisodium 
phosphate-based, antimicrobials, tributyltin oxide, oxazolidines, tetrakis 
(hydroxymethyl)phosphoniuin sulfate (THPS), phenols, chromated copper arsenate, zinc 
or copper pyrithione, carbamates, sodium or calcium hypochlorite, sodium bromide, 
5 halohydantoins (Br, CI), or mixtures thereof. 

Other possible components in the compositions of the invention include 
biodispersants (about 0.1% to about 15% by weight of the total composition), water, 
glycols (about 20-30%) or Pluronic (at approximately 7% by weight of the total 
composition). The concentration of antibiofouling composition for continuous or semi- 

10 continuous use is about 5 to about 70 mg/1. 

Antibiofouling compositions for industrial water treatment may comprise 
compounds of the invention in amounts from about 0.001% to about 50% based on the 
weight of the total composition. The amount of compound of the invention in 
antibiofouling compositions for aqueous water treatment maybe adjusted depending on 

15 the particular environment. Shock dose ranges are generally about 20 to about 140 mg/1; 
the concentration for semi-continuous use is about 0.5X of these concentrations. 

The invention also pertains, at least in part, to a method of regulating 
biofilm development. The method includes administering a composition which contains 
a transcription factor modulating compound of the invention. The composition can also 

20 include other components which enhance the ability of the composition to degrade 
biofilms. 

The composition can be formulated as a cleaning product, e.g. 9 a 
household or an industrial cleaner to remove, prevent, inhibit, or modulate biofilm 
development. Advantageously, the biofilm is adversely affected by the administration of 

25 the compound of the invention, e.g., biofilm development is diminished. These 

compositions may include compounds such as disinfectants, soaps, detergents, as well as 
other surfactants. Examples of surfactants include, for example, sodium dodecyl sulfate; 
quaternary ammonium compounds; alkyl pyridinium iodides; TWEEN 80, TWEEN 85, 
TRITON X-100; BRU 56; biological surfactants; rhamnolipid, surfactin, visconsin, and 

30 sulfonates. The composition of the invention may be applied in known areas and 
surfaces where disinfection is required, including but not limited to drains, shower 
curtains, grout, toilets and flooring. A particular application is on hospital surfaces and 
medical instruments. The disinfectant of the invention may be useful as a disinfectant 
for bacteria such as, but not limited to, Pseudomonadaceae, Azatobacteraceae, 

3 5 Rhizabiaceae, Mthylococcaceae, Halobacteriaceae, Acetobacteraceae, Legionellaceae, 
Neisseriaceae, and other genera. 
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The invention also pertains to a method for cleaning and disinfecting 
contact lenses. The method includes contacting the contact lenses with a solution of at 
least one compound of the invention in an acceptable carrier. The invention also 
pertains to the solution comprising the compound, packaged with directions for using the 
5 solution to clean contact lenses. 

The invention also includes a method of treating medical indwelling 
devices. The method includes contacting at least one compound of the invention with a 
medical indwelling device, such as to prevent or substantially inhibit the formation of a 
biofilm. Examples of medical indwelling devices include catheters, orthopedic devices 
10 and implants. 

A dentifrice or mouthwash containing the compounds of the invention 
may be formulated by adding the compounds of the invention to dentifrice and 
mouthwash formulations, eg., as set forth in Remington's Pharmaceutical Sciences, 18th 
Ed., Mack Publishing Co., 1990, Chapter 109 (incorporated herein by reference in its 
15 entirety). The dentifrice may be formulated as a gel, paste, powder or slurry. The 
dentifrice may include binders, abrasives, flavoring agents, foaming agents and 
humectants. Mouthwash formulations are known in the art, and the compounds of the 
invention may be advantageously added to them. 

In one embodiment, the invention pertains to each of the transcription 
20 factor modulating compounds described herein in Tables 4, 5, 6, 7 and 8, and in 
Formulae (T)-(X). 

The contents of all references, patent applications and patents, cited 
throughout this application are hereby expressly incorporated by reference. Each 
reference disclosed herein is incorporated by reference herein in its entirety. Any patent 
25 application to which this application claims priority is also incorporated by reference 
herein in its entirety. 

The invention is further illustrated by the following examples, which 
should not be construed as further limiting. 

30 EXEMPLIFICATION OF THE INVENTION 

Example 1: Synthesis of Test Compounds 

The transcriptional modulating compounds described in this application 
can be synthesized by art recognized techniques or using the methods described herein. 

35 
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6-r2-ATniTio-phenvl)-3-thioxo-3.4-dihvdro-2H-ri.2.41tria7 iti.S-nna 




H 

This was prepared by a modified literature procedure (Doleschall, G.; 



Lempert,K. Tetrahedron 1973, 29, 639-649). Isatin(10g, 67.96 mmol) was dissolved 
5 in ca. 10% aqueous KOH (9.9 g in 100 mL of water) and then treated with 

thiosemicarbazide (6.28 g; 68.90 mmol). After 1 hour of heating at 1 15 °C (bath 
temperature), the reaction mixture was poured over ice and treated with glacial acetic 
acid drop-wise, till the pK was ca. 5. The yellow fluffy precipitate was filtered, washed 
copiously with water (8 x 50 mL) and dried first in air and then under high vacuum to 
10 afford 12.9 g of yellow solid. 

6-(2-Amino-phenvl)-3-butvlsulfanyl-2H-rL2,41triazin-5-one 



15 Lempert, K. Tetrahedron 1973, 29, 639-649). The product from the previous 

experiment (8.0 g, 36.3 mmol) was dissolved in ca. 10% aq. KOH (10.3 g in 100 mL of 
water) and treated with "Bui (7 mL). Ethanol (70 mL) was added to it and the reaction 
mixture was allowed to stir overnight. The reaction mixture was diluted with ether (100 
mL) and water (70 mL). The ether layer was separated and the aqueous layer washed 

20 further with ether (3x100 mL) and then poured over ice. Upon careful, drop-wise 
addition of glacial acetic acid with vigorous stirring at 0-4 °C, yellow precipitate was 
obtained which was filtered, washed with water (4 x 20 mL) and then with ether (2 x 
lOmL) and dried. Yield: 5.12 g. 



25 following the similar method. 

3-Methvlsulfanvl-6-f GV6 J-dihvdro-5-oxa- 1 .2 A7-tetraaza-dibenzo faxlcvcloheptene 




This was prepared by a modified literature procedure (Doleschall, G.; 



Other alkyl or substituted alkyl halides were used instead of n-butyliodide 
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This was prepared by a modified literature procedure (Doleschall, G.; 
Lempert, K. Tetrahedron 1973, 29, 639-649). To a suspension of compound 2 (or 
analogs of 2) (ca. 0.384 mmol, 1 equiv) in dry ethanol (3-4 mL), 25 \\L of glacial acetic 
acid was added, followed by ca. LI equiv of the corresponding aldehyde (G-CHO, 
5 where, G = substituted or unsubstituted aliphatic, aromatic, or heterocychc groups). The 
reaction mixture was refluxed for ca. 5-7 min resulting in a dark red - dark-reddish 
orange solution. Upon cooling to room temperature orange-orange-yellow solid crashed 
out of solution, which was filtered, washed with cold (ca. -30 °C) methanol (2x1 mL), 
and/or ether and dried. In some cases, the crude products were recrystallized from 
1 0 DMF/ether or methanol/ether; in most of the cases, the crude products, prepared as 
above, were >95% pure. Various ketones (GCOG') were reacted with 2 (or analogs of 
2) in a similar way to afford compounds of structural type 4. All the final compounds 
were characterized by means of l H NMR, LC-MS, HPLC (Qg columns, 
acetonitrile/water with 0.01% triethylamine as mobile phase), and CHN analyses. 

15 

General synthesis of orthoesters, GCfOR^: R = Me 




The syntheses of the desired orthoesters were accomplished by a modified 
literature procedure in multiple steps (McClelland, R. A. et al J. Org. Chetn. 1981, 46, 

20 1011-1012). Several novel orthoesters were prepared by this method. To a solution of 
an acid chloride in dichloromethane, iV-methylaniline was added slowly, followed by 
triethylamine and catalytic amount of 4-dimethylaminopyridine. After stirring it for ca. 
12 h, the reaction mixture was diluted with ether, the precipitate was filtered, washed 
with ether and dried. The amide, thus prepared, was then stirred overnight with methyl 

25 triflate in dichloromethane, diluted with ether, and the precipitate was filtered, washed, 
and dried to obtain an imidatonium triflate salt. This salt was dissolved in 
dichloromethane, cooled to 0 °C, and added slowly, with stirring, to a cold (0 °C) 
solution of sodium methoxide in dry methanol over a period of ca. 30-60 min. The 
solvent was evaporated to dryness and the residue was extracted in n-hexane. Upon 

30 evaporation of hexane, the white solid was obtained, which was dissolved in dry 
methanol and treated with glacial acetic acid. After 10 minutes of stirring, the excess 
acid was neutralized with potassium carbonate (solid), and the solvent removed under 
vacuum. The residue was extracted in ether, washed with water, and dried over 
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potassium carbonate. The crude material was obtained by evaporation of ether, and 
further purified either by flash chromatography or fractional distillation. 

3-Methvlsulfanvl-6-( GV5-oxa-l 2 A J-tetraazaHiibenzora.c1cvcloheptene 




5 * 

Compound of the type 2 (0.384 mmol, 1 equiv) was suspended in ethanol 

(2-3 mL), treated with glacial acetic acid (100 pL), followed by an orthoester (2 equiv) 

of the general formula G-C(OR)3, where G = substituted or unsubstituted aliphatic, 

aromatic, or heterocyclic group, R = H, substituted or unsubstituted aliphatic, aromatic, 

10 or heterocyclic group. The reaction mixture was refluxed for 70-1 80 minutes, cooled to 

room temperature. In some cases, the product crashed out of solution, in others, the 

crude reaction mixture was evaporated to dryness, re-dissolved in a minimum amount of 

methanol and diluted with ether. The solid was washed with ether (cold, 0-4 °C; lx 

lmL) and dried under vacuum. 

15 

4-Iodo-3-nitrothioanisole 




A flask was charged with 10 g of 4-iodothioanisole and 5.5 mL of 
dimethylsulfate and warmed to ca. 90 °C for 10 min. The resulting solution was 

20 dissolved in cone, sulfuric acid (30 mL) and cooled to ca. 0-4 °C, whereupon it was 

treated, with extreme caution, slowly with cone. HNO3 (ca. 2 mL) while maintaining the 
reaction temperature below 4 °C. After stirring it for ca. 10 min, the reaction mixture 
was stirred at ca. 90 °C for ca. 3 d. The reaction was monitored with HPLC, TLC, and 
LC-MS, and if needed, smaller portions of nitric acid were added to the reaction mixture 

25 to force it to completion. Use of fuming sulfuric acid is also helpful. After the complete 
consumption of the aromatic starting material, the reaction mixture was cooled, poured 
over crushed ice, treated with 30% aq. perchloric acid at 4 °C. The light colored 
precipitate was filtered, washed thoroughly with water, and dried under vacuum. The 
perchlorate salt was stirred with saturated aq. NaCl solution at 95 °C for 3-6 h. Upon 

30 cooling to room temperature, the precipitate was filtered, washed thoroughly with water 
to get rid of any inorganic salts, and dried under vacuum to obtain 4-iodo-3- 
nitrothioanisole in > 80% yield. 
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4-Iodo-3-aminothioanisole 



^sr NH 2 

Ca. 7 g of 4-iodo-3-iritiothioanisole was dissolved in absolute ethanol and 




5 treated slowly with a solution of SnCl2.2H20 in 12% aq. HQ. The reaction mixture was 
stirred at 50 °C for 25-30 min., when the HPLC monitoring indicated that the reaction 
was complete. The reaction mixture was poured over crushed ice, and treated with aq. 
NaOH solution to pH 8. The precipitate was filtered, washed with water and dried in air. 
The crude material was crystallized by overnight cooling (4 °C) of its ethanol (minimum 
10 amount) solution treated with 10% aq. HC1. The crystalline material was further dried 
under high vacuum to afford the desired amine as its hydrochloride salt in >70% yield. 

^Methvlsulfanvl-y-nitro-biphenvl^vlainine 



solution/suspension of 4-iodo-3-nitrothioanisole (1 mmol), Pd(OAc)2 (0.01 mmol) was 
purged with argon for -5 min. To this solution Et3N (3 mmol), and 5 mL of water were 
added and purged with argon for another 5 min. To the above solution, 2-aminophenyl 
boronic acid (2 mmol, solution in 5 mL of DMF, purged with argon), was added and the 
20 reaction mixture was heated at 70 °C (oil bath temperature) for 2 h. The reaction was 
monitored by HPLC and LC-MS to follow the product formation. The reaction mixture 
was then cooled down to room temperature and filtered through diatomaceous earth. 
The filtrate was concentrated and purified using preparative HPLC. 

25 4-Methvlsulfanvl'2 ! -amino-biphenyl-2-vlainine 




X-H.-OR.-SR 



15 



A methanol/dioxane (20 mL/5 mL of methanol/dioxane) 




X-H.-OR.-SR 



A flask was charged with ca. 1 mmol of 4-Methylsulfanyl-2'-nitro- 
biphenyl-2-ylamine, 15 mL of ethanol, and 0.1 mmol of Pt02, and stirred under 
hydrogen atmosphere at 40 psi for 10 minutes. The reaction mixture was filtered 
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through diatomaceous earth, washed with ethanol, and the combined organic layer was 
evaporated to dryness. The crude material was purified by preparative HPLC. The same 
material can also be prepared by the previous method, (Suzuki coupling conditions) 
starting from 4-iodo-3-aminothioanisole, and purified by preparative HPLC. 

5 

6-(GV3-methvlsulfanvl-5H-dibenzo[(Lfiri31diazepine 




X-H,-OR,-SR 

To a solution of the 2,2 , -biphenyldiamine (0.093 g; 0.51 mmol) in 
ethanol (2 mL), were added glacial acetic acid (50 |iL) and 2 equiv of an orthoester of 

1 0 the general formula GC(OR)3. In case of TFA salt of the diamine, there was no need of 
adding acetic acid to the reaction mixture. The reaction mixture was refluxed for 3h, 
cooled to room temperature, and evaporated to dryness. The residue was suspended in 
methanol saturated with dry HC1, stirred for a few minutes, filtered, washed with 
methanol, and finally with ether. The hydrochloride salt of the diazepine was dried 

15 under vacuum to afford a light yellow solid. In order to obtain a free base of the 
diazepine, the above hydrochloride salt was suspended in methanol, and treated with 
10% aq. NaOH solution. After stirring at room temperature for ca. 10 min, the 
precipitate was filtered, washed with water, and dried under vacuum. 

20 3-(6"Nitro-2'phenvl'benzoimidazol-l-vlVpropionitrile and 3-(5-Nitro-2-phenvl- 
benzoimidazol- 1 -vlVpropionitrile 




A mixture of 5-nitro-2-phenylbenzimidazole (lg, 4.2 mmol), acrylonitrile (50 mL) and 
N, N-dimethylpiridine (25 mg) was heated at 70 °C for 4 hr. The excess of acrylonitrile 
25 was evaporated and oily residue was subjected to the flash chromatography on silica gel 
using hexane-ethyl acetate (75: 25 v/v) as an eluent. Structure of the regioisomers was 
determined using *H NOESY studies. 0.25 g (20%) of the 6-nitro isomer and 0.23 g 
(18.9 %) of the 5-nitro isomer were obtained. 
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3-f6-Nitro-2-phenvl-benzoimidazol-l-vlVpropioiiic acid and 3-(5-Nitro-2-phenvl- 
benzoimidazol-l-vlVpropionic acid . 




To the 6-nitro nitrile (0.15 g, 0.51 mmol) concentrated HC1 (5 mL) was 
5 added and resulting mixture was heated at 50 °C for 0.5 hours. The acid was evaporated 
in vacuo and product was purified by HPLC. Yield 34 mg (21 %). An identical 
procedure was used starting from the 5-nitro nitrile yielding the product (22 mg, 13.8 %). 

Example 2: 

0 In this example, the expression of a selective marker (e.g., ccdB) is put 

under the direct control of a promoter activated by MarA (e.g. , inaA, galT, or micF). In 
the absence of MarA, the expression of the selective marker is silent and cells survive. 
Synthesis of MarA from an inducible plasmid in a bacterial or yeast cell leads to the 
activation of the inaA promoter, expression of ccdB, and subsequently results in cell 

5 death. Compounds that inhibit MarA are those that permit cell survival under conditions 
of MarA expression. The results of this assay are given in Table 4. In Table 4, * 
indicates good inhibition of MarA and ** indicates very good inhibition of MarA. 



Example 3: 

In this example, the expression of luciferase is put under the direct control 
of a promoter activated by MarA (e.g., inaA, galT, or micF) in a cell constitutively 
expressing MarA. In the absence of MarA, cells luminesce. Upon modulating of MarA 
activity, the expression of the reporter is altered. 

TABLE 4 



ID 


Structure 


Name 


Affinity 


A 




6-(5-Iodo-furan-2-yl)-3~methylsulfanyl- 
6,7-dihydro-5-oxa-l ,2,4,7-tetraaza- 
dibeuzo [a,c]cycloheptene 


** 


B 




6-(4-Ethoxy-phenyl>3-methylsulfanyl- 
6,7-dihydro-5-K)xa-l,2,4,7-tetraaza- 
dibenzo [a,c]cycloheptene 


* 


C 


.Ax 


3-Me%lsiilfanyl-6^5-nitro-furan-2-yl)- 
6,7-dihydro-5-oxa-l,2,4,7-tetraa2a- 
dibenzo foclcycloheptene 


* 
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3-Mefliylsulfenyl-6-[5-<4-nitro-phenyl)- 
furan-2-yl]-6,7-<iihydro-5-oxa- 1,2,4,7- 
tetraaza -dibenzo [a,c] cycloheptene 



4-(3-Ethylsulfanyl-6,7 -dihydro-5-oxa- 
1,2,4,7-tetraaza -dibenzo [a,c] 
cyclohepten-6-yl)-benzene-l ,2-diol 



6-(4-Benzyloxy-phenyl)-3-propylsulfanyl- 
6,7-dihydro-5-oxa-l ,2,4,7-tetraaza- 
dibenzo [a,c]cycloheptene 



6-Benzo[l ,3]dioxol-5-yl~3- 
methylsdfanyl-6,7-dihydn>-5-oxa-l,2,4,7- 
tetraaza-dibenzo [a,c]cycloheptene 



??? 



3-Butylsulfanyl-6-(2,4-dimethoxy- 
phenyl>6,7-dihydro-5-oxa-l,2,4,7- 
tetraaza -dibenzo[a,c] cycloheptene 



6-(4-AUyloxy-phenyl)-3-butylsulfanyl- 

6,7-dihydto-5-oxa.l,2,4,7-tetraaza- 

dibenzo[a,c]cycloheptene 



3-Butylsulfanyl-6-(4-ethoxy-phenyl)-6,7- 
dihydro-5-oxa-l ,2,4,7-tetraaza-dibenzo 
[a,c] cycloheptene 



K 




6-(4-Methoxy-pbenyl)"-3-piopylsulfanyl- 
6,7-dihydro-5-oxa-l,2,4,7-tetraaza- 
dibenzo [a,c]cycloheptene 




6-[5-(3.Nitro-phenyl)-furan-2-yl]-3- 
propylsulfanyl-6,7-dihydro-5-oxa-l,2,4,7- 
tetraaza -dibenzo[a,c] cycloheptene 




2-(3-PhenyHH-pyrazol-4-ylmethylene)- 
benzo[4 s 5] imidazo[2,l-b]thiazol-3-one 



2-[5-(3-Carboxy-phenyl)-furan-2- 
ylmethylene]-5-(2-methoxy-naphthalen-l - 
yi)-7-methyl-3-oxo-2,3-dihydro-5H- 
thiazolo[3^-a]pyrimidine-6-carboxylic 
acid ethyl ester 



5-(4-Dimethylamino-phenyi)-7-methyl-2- 
[5-(2-methyl-4-nitro-phenyl)-furan^2-yl 
methylene]-3-oxo-2,3-dihydro -5H- 
thiazolo[3,2-a] pyriimdine-6-carboxylic 
acid ethyl ester 



*** 



-108- 



WO 2004/041209 



PCT/US2003/035205 



p 




5-Benzo[l,3]dioxol-5-yl-7-methyl-2-[5- 
(2-methyM-mtro-plienyl)-ftrran-2-yl 
methylene]-3-oxo-2,3-dihydro -5H- 
thiazolo[3,2-a] pyrirnidine-6-carboxylic 
acid ethyl ester 


* 


Q 




S-Y^ 4.-T>tmptVinYV-TiVipnv1V.7-T»if»tiiv1-'?-r^- 

J ^ a *T"J^UilwULl\JAjr "pilClljrl/" / Hit' UAJ' 1""^ I*' 

(2-methyl-4-nitro -phenyl)-furan-2-yl 
methyleae]-3-oxo-2,3-(Mhydro-5H- 
tbiazolo[3^-a]pyrimidine-6-carboxylic 
acid ethyl ester 


*** 


R 




/ -Metnyi-z-[D^z-inetByi-4-rutro~pneiiyi y 
furan-2-ylmethylene]-5-(4-methyl 
sulfanyl-phenyl)-3-oxo-2,3-dihydro-5H- 
thiazolo[3,2-a] pyrirnidine-6-carboxylic 
acid ethyl ester 


*** 


S 


vCO 

HO O 


2-[5-(4-Carboxy'phenyl)-furan-2- 
yImethylene]-5-(2-methoxy-naphthalen-l- 
yl)-7-methyl-3-oxo-2,3-dihydxo-5H- 
thiazolo[3,2-a]pyrimidine-6-carboxylic 
acid ethyl ester 


*** 


T 




5-Benzo[l,3]dioxol-5-yl-2-[5<4- 
ethoxycarbonyl-phenyl)-furan-2- 
ylmethylene]-7-methyl-3-oxo-2,3- 
dihydro-5H-thiazolo[3,2-a]pyririiidine-6- 
carboxylic acid ethyl ester 


*** 


U 




7-M*ethyl-3-oxo-5-phenyl-2-[5-{3- 
trifluoromethyl-phenyl)-fiiran-2- 
ylmetiiylene]-2,3-dihydro-5H- 
tMazolo[3,2-a]pyriraidine-6-carboxylic 
acid ethyl ester 


** 


V 


9 


7-Methyl-2-[5-(2-rnethyl-4-nitro-phenyl)- 
furan-2-yl methylene]-3-oxo-5-phenyl- 
2,3-dihydro-5H-thiazolo[3,2- 
a]pyrinudine-6-carboxylic acid ethyl ester 


*** 


W 


0 ^"N HO 


2-[5-{3-Carboxy-phenyl)-ruran-2- 
ylmethylene]-5^4^iime&ylarriirio- 
phenyl)-7-methyl-3-oxo-2,3-dihydro-5H- 
thiazolo[3,2-a]pyrimidine-6-carboxylic 
acid ethyl ester 


He* 


Y 

-rv 


Y 


5^4-Dimethylainino-phenyl)-7-methyl-2- 
[5<4-methylO-nitro-phenyl>ruran-2-yl 
methylene]-3-oxo-2 3 3-dihydro-5H- 
thiazolo[3,2-a]pyrimidine-6-carboxylic 
acid ethyl ester 


** 



-109- 



WO 2004/041209 



PCT/US2003/035205 



Y 


-I 6 


2-[5-(3-Carboxy-phenyl)-fkran-2- 
ylmetnylenej- / -metnylo -(4- 1 
methylsulf anyl-phenyl)-3 -oxo-2,3 - 
dihydro-5H-Mazolo[3,2^]pyrimidine-6- 
carboxylic acid ethyl ester 


*** 


Z 




[l,2]Naphthoquinone l-[0-(6-oxo-6H- 
anthra[l,9-cd] isoxazol-5-yl)-oxime] 


* 


AA 




3-Acetyl-2,5,7-triphenyl-lH- 1 ,3a,4,8- 
tetraaza-7a-azonia-cyclopenta[a]indene 


*** 








AB 




1 -Amino-3 -benzo [ 1 ,3]dioxol-5-yl- 
benzo[4,5]imidazo[l,2-a] pyridine-2,4- 
dicarbonitrile 


** 


AC 




2-[2-(5-Furan-2-yl-4-phenyl-4H- 
[l,2,4]triazol-3-yl sulfanyl)-acetylamino]- 
benzoic acid methyl ester 


* 


AD 




6,7-Dime%l-2-(3-phenyl- lH-pyrazol-4- 
ylmethylene)-benzo[4,5]imidazo[2, 1 -b] 
thiazol-3-one 


* 


AE 




2-(5-Benzo[l,2,5]oxadiazol-5-yl-4- 
methyl-4H-[l,2,4] triazol-3-ylsulfanyl)-N- 
(3-methykidfimyl-phenyl)-acetamide 


* 


AF 




4-(l ,3-Dioxo-indan-2-ylidene)-2-phenyl- 
6-pyridin-2-yl-tetrahydro-pyrrolo[3,4-c] 
pyrrole- 1,3-dione 


** 


AG 




6-Nitro-2-phenyl-l-(3-.trifluoromethyI- 
benzyloxy)- lH-benzoimidazole 


** 


AH 




(6-Nitro-2-phenyl-benzoimidazol- 1 - 
yioxy)-acetic acid 


** 


AI 




1 -Benzyloxy-6-nitro-2-phenyl-lH- 
benzoimidazole 


** 


AJ 




1 -(4-Methyl-benzyloxy)-6-nitro-2-phenyl- 
lH-benzoimidazole 


* 



-110- 



WO 2004/041209 



PCT/US2003/035205 



AK 


3 




6,8-Dimethyl-2-(4-nitro-phenyl>5- 

phenyl-5H,6H-l-oxa-3,5,6,8-tetraaza- 

cyclopenta[a]naphthalene-4,7,9-trione 


** 


AL 




6,8-Dimethyl-5-phenyl-2-p-tolyl-5H,6H- 

l-oxa-3,5,6,8-tetraaza-cyclopenta 

[a]naphthalene-4,7,9-trione 


* 


AM 




2-[3-{4-Fluoro-plienyl)-l-phenyl-lH- 
pyrazol-4-yl methylene]-benzo [4,5] 

iTTnaaZQlZ, 1 -DJ unazoio-uiie 


** 








AN 






Cobalt 5,10,15,20-Tetra-pyridin-4-yl- 
porphyxine 


*** 


AO 




2- [3-(4-Fluoro-phenyl)- 1 -phenyl- 1 H- 
pyrazol-4-ylmethylene]-5-methyl-6-vinyl- 
inridazo[2, 1 -b]thiazol-3-one 


** 


AP 






Cobalt 5,10,15^0-Tetra-pyridin-3-yl- 
porphyrine 


*** 


AQ 






Zinc 5,10,15,20-Tetra-pyridin-4-yl- 
poiphyrine 


*** 


AR 




2^4-isopropylphenyl)-4H-chromen-4-one 


*** 


AS 




2-(3,4-Dihydroxy-phenyl)-3,5,7- 
trikydroxy-chromen-4-one (luteolin) 


*** 


AT 




N-isopropyl-2-[(4-raethyl-5-quinolin-6-yl- 
4H-l,2,4-triazol-3-yl)thio]acetamide 


*** 


AU 


As 


4-hydroxy-6-methyl-5,6-dihydro-2H- 
pyrano[3 £-c]quinolinfi-2,5-dionfi 


*** 
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AV 




5,7-Dihydroxy-2-(4-hydroxy-pheiiyl> 
chromen-4-one 


*** 


AW 




2-[4-(dimethylamino)phenyl]-4H- 
chromen-4-one 


** 


AX 




l-(benzyloxy)-2-phenyl-lH-imidazo[4,5- 
b]pyridine 


** 


AY 




2-(benzylthio)-4-phenyl-5 -( 1 -phenyl- 1 H- 
1 ^.S^-tetraazol-S-ylJpyrinudine 


** 


AZ 




6-fhioro-2-phenyl-4H-chromen-4-one 


** 


BA 




7-methoxy-2-phenyl-4H-chromen-4-one 


* 


BB 




4-(l,3-dioxo-l,3-dihydro-2H-inden-2- 
yliden)-2-phenyl-6-(2-pyridinyl) 
tetrahydropyrrole [3,4-c]pyrrole- 
l,3(2H,3aH)-dione 


* 


BC 




2-(2-Hydroxy-3-oxo-5-p-tolyl-2,3- 
dihydro-furan-2-yI)-malonamic acid ethyl 
ester 


* 


BD 


o 
II 


2-[(6-nitro-2-phenyl- 1 H- 1 , 3 - 
benzimidazol- l-yl)oxy]acetic acid 




BE 




2-(4-fluorophenyl)-4H-chromen-4-one 


* 
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BF ' 




1 -methoxy-2-(4-methyl phenyl)-lH- 1 
imidazo [4,5-b] pyridine 




BG 




2-(4-hydroxyphenyl)-4H-chromen-4-one 


* 



Example 4: 

In this example, the expression of a selective marker (e.g, ccdB) is put 
under the direct control of a promoter repressed by MarA (e.g. 9 fecA,purA 9 guaB). 
5 Under conditions of constitutive MarA synthesis {e.g., using a constitutive mar (mar°) 
mutant), the expression of ccdB is silent. Following inactivation of MarA, the synthesis 
of ccdB results in cell death. 

Example 5: 

10 In this example, the expression of a selective marker (e.g., URA3) is put 

under the direct control of a promoter repressed by MarA (e.g.,fecA,purA, guaB). 
Under conditions of constitutive MarA synthesis (e.g., using a constitutive mar (mar c ) 
mutant), the expression of URA3 is silent. Following inactivation of MarA, and in the 
presence of 5-FOA the synthesis of URA3 results in cell death. 

15 

Example 6: 

In this example, a purine or guanine heterotroph is constructed by 
inactivation of either of the chromosomal guaB or pur A genes in E. coli. The guaB or 
pur A gene is then placed into a suitable vector under the control of its natural promoter 
20 and transformed into the heterotrophic host. 



25 



Example 7: E. coli Biofilm Assay 

The biofilm assay screens test compounds for their ability to inhibit 
bacteria from forming a biofilm. 

Materials: 

The M9 media ("M9") contained M9, casamino acids , and glucose. The 
test compound was dissolved in lOmg/mL DMSO stock solution. 



30 
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Method: 

Preparation of Inoculum 

Inoculum was started the day of the experiment by adding a colony or 
5 glycerol stock stab to 2mL M9. The tube was placed in the 37 °C shaker incubator for 
approximately 4-6 hours. This inoculum was referred to as the "Starter inoculum/* The 
inoculum was then removed from the shaker incubator and diluted to 1 X 10 6 cells/mL in 
M9. 

10 Preparation of Controls 

Typically, there were eight of each control, including a positive and 
negative control. For both the positive and negative controls, 2.5 fiL of DMSO was 
added to 200 fiL of M9. 25 /zL of the diluted DMSO was added to 50 /iL of M9 in the 
assay plate. 

15 

Preparation of Test Compounds 

The test compounds were screened at 20 /ig/mL. 2.5 fiL of the test 
compound were taken from a plate containing lOmg/mL stock and added to 200/iL of 
M9 and mixed. 25 fiL of the diluted test compound was added to 50 fiL of M9 in the 
20 assay plate. The resulting concentration of the test compound was 40 ptg/mL 

Preparation of Plate 

75 fiL of the inoculum at 1 X 10 6 cells/mL was added to each well 
containing compound and the positive controls. 75 fiL M9 was added to the negative 
25 controls. The final concentration of the test compound was 20 /ig/mL and the final 
concentration of the inoculum was 2 X 10 5 cells/mL. The plates were then placed in a 
microplate reader (Wallac VictorV) and read OD535 ("Initial growth reading"). The 
plates were then placed in an incubator overnight at 35 °C. In the morning, the plates 
were read in a microplate reader at OD535 ('Tinal growth reading.") 

30 

Addition of Crystal Violet 

The inoculum was then removed from the wells and the plates were 
washed several times with tap water. 1 50 /xL of Crystal Violet (0.02% Crystal Violet 
dissolved in water) was then added to each well. 

35 

Addition of Ethanol 

The crystal violet was then removed and the plates were washed several 
times with tap water. 1 50 /iL of ethanol were then added to each well, after mixing. The 
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plates were then placed in a microplate reader and read the OD535. This was referred to 
as the "Crystal Violet" reading. 

Data Analysis 

5 To determine whether a test compound inhibited growth, the Initial 

growth reading was subtracted from the Final growth reading ("Subtracted Growth").The 
same was done for the positive controls and averaged. The % inhibition of growth was 
determined by the following formula: 

10 100-(100*Subtracted growth of sample/Average growth of Pos Controls) 

To determine whether a test compound inhibited Biofilm formation, the 
%Inhibition of Biofilm Formation was determined using the following formula: 

15 100-(100*Crystal Violet read of sample/Average crystal violet read of Pos Controls) 

The results from the Crystal Violet assay are summarized in Table 5. In 
Table 5, ND indicates that a given compound did not inhibit biofilm formation in the CV 
assay. * indicates that the test compound inhibited some biofilm formation and ** 
20 indicates that the compound inhibited the formation of a biofilm well. 

Example 8: LANCE Screening Assay for Inhibitors of MarA, SoxS, or Rob DNA- 
binding 

This example describes a method for the identification of test compounds 
25 that inhibit the interactions of purified transcription factor such as MarA, SoxS and/or 
Rob with a target DNA sequence in an in vitro system. Such molecules will hopefully 
be able to inhibit this interaction in vivo, leading to inhibition of transcriptional 
regulation by these factors and ultimately in inhibition of the drug resistance and/or 
virulence phenotypes associated with MarA, SoxS and Rob. 

30 

Materials 

The 6His-tagged MarA, SoxS and Rob purified according to respective 
protocol. The N-term-biotinylated double-stranded DNA has a sequence of CCG ATT 
TAG CAA AAC GTG GCA TCG GTC (SEQ ID NO. 5). The antibody used was the 
35 LANCE Eu-labeled anti-6xHis Antibody (Eu-aHis) (Perkin Elmer cat # AD01 10) which 
had at least 6 Europium molecules per antibody. Streptavidin conjugated to 
SureLight™-Allophycocyanin (S A-APC) was obtained from Perkin Elmer (cat # 
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CR130-100). The Assay buffer contained 20mM Hepes pH 7.6, ImM EDTA, lOmM 
(NH4) 2 S0 4 , and 30mM KC1, and 0.2% Tween-20. 

Method 

5 The plates or vials of the compounds to be tested were thawed. These 

stocks were at a concentration of lOmg/ml in DMSO. The solutions were allowed to 
thaw completely, and the plates were briefly shaken on the Titermix to redissolve any 
precipitated compound. Thawed aliquots of MarA, SoxS and Rob protein from the stock 
stored at -80°C and 1M stock of dithiothreitol stored at -20 e C were then placed on ice. 

1 0 Dilutions at 1 : 1 00 of the compounds were made into a fresh 96-well 

polystyrene plate. The dilutions were prepared with 1 00% DMSO to give a final 
concentration of 1 00 ng/ml solutions. The dilutions were vortexed on a Titermix. 

Fresh DTT was added to 25-50 mL of assay buffer to produce a ImM 
final concentration. Next, 90 pi of assay buffer was added to each of the 10 \il protein 

15 aliquots, and the solution was mixed by pipetting. These proteins were diluted to give 
the required amount of each of the diluted proteins, resulting in 20 nl of diluted protein 
per well. In preparing the solutions, 20% excess was made to allow enough for control 
wells. Typically, depending on the protein preps and the initial binding curves that were 
performed, 1000-2000 finoles of each protein was required per well. The diluted protein 

20 solutions were the placed on ice. 

Three tests plates per plate of compound (for MarA, SoxS and Rob) were 
prepared. Using a multichannel pipet, 5|xl of the compound was added to each well. 5^1 
of DMSO was added to the blank and control wells, and 5\i\ of the control inhibitor was 
added to the respective wells. 

25 Using the multichannel pipet, 20^1 of protein was added to all wells 

except those designated "blank". To these blank wells, 20^1 of assay buffer was added. 
The plates were covered with a plate sealer and incubated at room temperature, shaking 
on the Titermix, for 30minutes. 

Next, the DNA solution was prepared, with enough for at least 20% more 

30 wells than were tested. 15^1 (0.4 finoles) was added per well. Then the DNA was 

diluted in assay buffer, and vortexed briefly to mix. The plate sealer was removed, and 
15 Hi of DNA solution was added to all of the wells, the plates were then resealed, and 
returned to the Titermix for a further 30 minutes. 

After 25 minutes, the antibody solution was prepared. 0.4 finoles of SA- 

35 APC and 0.125 finoles of Eu-oHis were added per well in a total volume of IOjaI. 
Amounts were prepared sufficient for at least 20% excess. The plate sealer was the 
removed and 10 |il of antibody solution was added to every well. Thfe plates were 
subsequently resealed, placed on the Titermix, and covered with aluminum foil. The 
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plates were mixed for 1 hour. The plates were then read on the Wallac Victor V, using 
the LANCE 61 5/665 protocol. 

Data processing 

5 For each plate, the mean control (i.e. signal from protein and DNA 

without inhibition), mean blank (background signal without protein) and mean inhibitor 
(P001407) LANCE 6 65 counts were determined. The percentage inhibition by each 
molecule (each test well) was then determined according to the following equation: 

10 % Inhibition == 100-(((test-meanblank)/(mean control-mean blank)*100) 

Compounds that gave 40% or greater inhibition were identified as hits 
and screened again for IC50. 

15 ICso screening 

The protocol used was identical to that outlined above, except that only 
10 compounds were assayed per plate. The testing concentrations started at 10 ^ig/ml 
and were diluted two-fold from 10 to 0.078^g/ml. 

20 ICso Data processing 

Percent inhibition was calculated as shown above. Percent inhibition was 
then plotted vs. log (cone. Inhibitor) using Graph pad Prism software. The IC50 
concentration was determined as the concentration that gives 50% inhibition. 

The data from this example is also summarized in Table 5. *** indicates 
25 that a particular test compound inhibited the particular bacteria very well, ** indicates 
that the particular test compound inhibited the particular bacteria well, and * indicates 
that the particular test compound inhibited the particular bacteria to some extent. 

Example 9: Luciferase Assay 
30 The luciferase assay is used to determine if any of the compounds tested 

reduce the luminescent signal. This indicates that the test compounds affect regulation 
of micF, which in turn is regulated by Mar. 

Materials 

35 The bacteria used were E.coli AG1 1 2KmicF-Luc. The negative control 

Bacteria were E.coli AG1 12. The test compounds were prepared in a lOmg/mL DMSO 
stock solution. 
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Methods 

Preparation of Inoculum 

Inoculum (or "Starter Inoculum") was started the night before the day of 
the experiment by adding either a colony or stab of a glycerol stock to 2mL of LB Broth. 
5 The Starter Inoculum was then placed in a 37 °C shaker incubator and allowed to grow 
overnight. 

The following day, the Starter Inoculum was removed from the shaker 
and added to fresh LB Broth. For each plate to be assayed, 6mL of LB broth was 
prepared, with 5-10fiL of Starter Inoculum being added per mL of added LB to form the 
10 'Test Inoculum". Typically, four plates of test compounds were assayed In this typical 
example, 6mL of LB Broth was used for each plate, or 24 mL of LB, followed by the 
addition of 5|iL/mL of Starter Inoculum, or 120jjL of Starter Inoculum to form the Test 
Inoculum. 

Following preparation of the Test Inoculum, the Test Inoculum was 
15 placed in a 37 degree Celsius shaker and incubated for about 4 hours. The Test 

Inoculum was monitored for bacterial growth by taking OD readings at 535 nm on a 
spectrophotometer. The Test inoculum should be removed when the OD reaches 
between 0.6 and L5. 



20 Preparation of Controls 

Positive and negative controls were created by adding 2uL DMSO to 
198uL LB Broth. At least 4 of each control were generated. Typically, there were 8 of 
each. 50uL of diluted DMSO was added to 50uL LB Broth in the assay plates. 

25 Preparation of Compounds 

The compounds were screened at 25ug/mL. Two identical plates of each 
compound were set up: 1 clear plate for growth (or "Clear Plate"), 1 white plate for 
luminescence (or "White Plate"). Next, 2 \£L of each compound was taken from the 
daughter plate (containing lOmg/mL stock) and added to 198 /aL of LB Broth. The 

30 sample was then mixed. Next, 25 \£L of the diluted test compound was added to 25 jzL 
of LB Broth in all of the assay plates. The concentration of the compound at this stage 
was 50 /xg/mL. 

Preparation of Plate 

35 50 \£L of the Test Inoculum was added to each well of the plates, except 

for the negative controls. Half of the negative controls received 50 fiL of AG1 12, while 
the other half of the negative controls received 50 fiL LB Broth. The final concentration 
of the test compound was 25 /xg/mL. 
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The Clear Plates were placed in the plate reader and read at OD535. This 
was the 'Initial" growth read. The plates were then incubated plates for 5 hours at 37 
degrees Celsius. After 5 hours, the plates were removed from the incubator. The Clear 
Plates were placed in the plate reader and read at OD535. This was the "Final" growth 
5 read. 1 
100 /iL of Promega Steady-Glo reagent was added to each well (including all 
controls) in the White Plates. The plates, covered with aluminum foil, were then shaken 
on a plate shaker set at 10000 rpm for 10 min. The plates were then placed in plate , 
reader and read on luminescence for Isec per well. This was the LUMINESCENT read. j 

10 

Data Analysis 

To determine whether the test compound inhibited growth, the Initial growth read 
was subtracted from the Final growth read. This was the Subtracted Growth. The same 
calculation was performed for the positive controls. The results for the positive controls 1 
15 were averaged. The %Inhibition of Growth was determined using the following , 
formula: 

100-(100*Subtracted growth of sample/Average growth of Pos Controls) j 

20 To determine whether compound inhibits Luciferase, use the following equation: j 

I 

10G-(100*Luminescence for Compound/Average Luminescence of Pos Controls) 

ND indicates that a particular test compound did not appear to decrease 
25 the lumninesce of in this particular assay. * indicates that the luminescence was 

decreased somewhat and ** indicates that the luminescence was decreased a substantial 
amount. The results from this assay are also shown in Table 5. 

i 

I 
I 
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Example 10: Synthesis of Various Benzimidazole Compounds 



Synthesis of (6-Nitro-benzoirnidazol-l-vloxv)-acetic acid (6) 




(2,4-Dinitro -phenylamin oVacetic acid methvl ester (2\ A mixture of 1- 
fluoro-2,4-dinitrobenzene (1) (15g, 0.81 mmol), glycine methyl ester hydrochloride (11.5 
g, 0.92 mmol), K 2 C0 3 ( 22.3 g, 0.162 mmol) and methanol (300 mL) was heated at 60 
°C for 30 minutes. After cooling in an ice bath, the resulting yellow precipitate was 
10 collected by filtration, washed with water and methanol and dried in vacuo. Yield 1 0.5 g 
( 51%). 

l-Hvdroxv-6-mtro-lH-ben2oimidazole-2-carboxvhc acid ethvl ester (3V 
A solution of (2,4-Dinitro-phenylamino)-acetic acid methyl ester (2) (3g, 1 1.8 mmol) in 
ethanol (100 mL) was heated to 70 °C. After addition of 2.4 mL (24.2 mmol) of 

1 5 piperidine, the solution was refluxed at 70 °C. After 2 hours, the solvent was removed 
in vacuo and the resulting residue was dissolved in water (100 mL). Acidification of the 
solution with HC1 yielded a yellow precipitate, which was collected by filtration, washed 
with water and ethanol and dried in vacuo. Yield 1.9 g, (63%) of yellow solid. 

6-Nifro-ben7.niTnirifl?rti. i.oi hydrochloride (4) . A mixture of l-hydroxy-6- 

20 mtro-lH-benzoimidazole-2-carboxylic acid ethyl ester (3) (5g, 20 mmol ) and 

concentrated HC1 (100 mL) was refluxed for 3 hours. After cooling the mixture to room 
temperature, the resulting solid was collected by filtration. Yield 1 .9 g (44 %) of the HC1 
salt. 
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6-Nitro-benzoimidazol-l-vloxvyacetic acid ethyl ester (5) To a mixture 
of 6-nitro-benzoimidazol-l-ol hydrochloride (4) (2 g, 9.3 mmol) and K2CO3 (2.56 g, 19 
mmol) 

in DMF (60 mL) was added ethyl bromoacetate (3. 1 g, 19 mmol) with stirring at room 
5 temperature. After 4 hours, the reaction mixture was poured into water. The resulting 
solid was collected by filtration, washed with water and ethanol and dried in vacuo. 
Yielding (49%). 

(6-Nitro-benzoimidazol-l-yloxyy acetic acid (6) A mixture of 6-nitro- 
benzoimidazol-l-yloxy)-acetic acid ethyl ester (5) (250 mg, 0.94 mmol ), THF (5 mL), 
10 water (ImL) and concentrated HC1 (1 mL) was heated to reflux for 2 hours. The reaction 
mixture was evaporated and the crude residue was purified by HPLC (21.2 x 250 mm 
Phenomenex Luna C18(2) column; flow rate = 20 mL/min; linear gradient 0-100% B 
over 30 minutes; A Buffer = water with 0.1% TFA, B Buffer = acetonitrile with 0.1% 
TFA). HPLC solvents removed in vacuo to yield yellow solid. Yield 65 mg (29 %). 

15 

Synthesis of 2-arvl-benzimidazoles 




Ri = N0 2 , F, CI, Br, NH 2 , NHAc, COMe, COPh, CF 3 , COOH, OMe, CN, 
CONH2, 'Bu, COOR, etc. 
20 R 2 = substituted or unsubstituted phenyl, substituted or unsubstituted heterocycle 

(5 or 6 membered rings etc.) 

To a solution of Ri-substituted-2-nitrofluorobenzene in DMF or DMSO, 
was added 2 equiv of NaHC03. Ca. 1.5-2 equivalents of the corresponding substituted 

25 benzylamine (e.g., R 2 = -H, -Me, -NH 2 , -CI, -OMe, -C(=NH)NH 2 etc.) was added slowly 
to the reaction mixture with vigorous stirring. The reaction was monitored by 
HPLC/TLC and upon complete consumption of the starting material, the reaction 
mixture was poured into ice water and the precipitate was filtered, washed with excess 
water and air-dried. La some cases, upon pouring the reaction mixture in water, 

30 treatment with 10% dilute HC1 (aq.) was needed to wash away excess salts and any base. 
The material thus obtained is usually pure, and can be used for the next step without any 
further purification. Yields are between 50-95%. Purity of each of the compounds was 
confirmed using l H NMR spectroscopy, HPLC, and MS. 
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OH 




Ri = N0 2 , F, CI, Br, NH 2 , NHAc, COMe, COPh, CF 3 , COOH, OMe, CN, 
CONH2, 'Bu, COOR, etc. 
5 R2 = substituted or unsubstituted phenyl, substituted or unsubstituted heterocycle 

(5 or 6 membered rings etc.) 



A solution/suspension of the substituted nitro compound (from the 
previous step) in methanol or THF or methanol/DMF was warmed to 50 °C, treated with 

10 an excess base (NaH, CH 3 0"Na + , aq. NaOH, etc.) and stirred. HPLC monitoring of the 
reaction mixture indicated the completion (10 min - 12 h depending upon the 
substituent) of the reaction. The reaction mixture was poured over ice, treated with dil. 
HC1 to acidic pR, and the resulting precipitate was filtered, washed thoroughly with dil 
HC1, and finally with water. In cases, where the product was water soluble, the reaction 

15 mixture was quenched with ice-water, evaporated to dryness, and the product was 
purified via extraction, washing, or if necessary, via chromatography. All the 
compounds were characterized using HPLC, MS, and l H NMR spectroscopy. Yields: 30 
-90% 



COOH 



R1 

20 




R t - N0 2 , F, CI, Br, NH 2 , NHAc, COMe, COPh, CF 3 , COOH, OMe, CN, 

CONH2, 'Bu, COOR, etc. 
R 2 = substituted or unsubstituted phenyl, substituted or unsubstituted heterocycle 

(5 or 6 membered rings etc.) 

25 

To a DMF solution of the benzimidazole-N-hydroxide (from the previous 
step) 1.5-2 equiv of anhydrous Na 2 C03 was added, followed by the addition of 1.2-1.8 
equiv of bromo-acetic acid. The reaction mixture was stirred at room temperature and 
monitored via HPLC/MS. Upon completion of the reaction, the reaction mixture was 
30 poured over ice and treated with dil. HC1 to an acidic pil. In most cases the product 
crashed out of the solution, which was filtered, washed thoroughly with dil. HC1, and 
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water and air dried. The product thus obtained is usually pure, but when needed, it could 
be recrystallized from DMF/ether or methanol/ether or dichloromethane/hexane solvent 
systems. In cases, where the product is water soluble, the quenched reaction mixture 
was concentrated to a volume where the product started to crash out. The final product 
5 was purified via chromatography in such cases or in cases that the crude material is 
impure. The final product was characterized using HPLC, MS, ] H NMR spectroscopy, 
and in some representative cases, using CHN analyses and 13 C NMR spectroscopy as 
well. 

1 0 Example 1 1 : SoxS Gel Shift Assay of Test Compounds 

The test compounds were diluted in DMSO to the required concentration 
and added to the appropriate wells. Protein (SoxS) was added to the wells in EMSA 
buffer at a concentration that was determined to cause a 50% shift of the DNA. The 
plates were then covered, mixed and shaked for 30 minutes at room temperature to allow 

1 5 for compound-protein binding. 

Ten jil of DNA mix (2.4 jil 5x EMSA buffer, 0.2 \il poly(dIdC), 1 |U 33 P- 
DNA probe, 7.4 fil dH 2 0 per reaction) was then added to each well. The final DNA 
concentrations were approximately InM. The samples were then mixed for 15 minutes 
at room temperature which allowed protein-DNA complexes to form. 

20 Electrophoresis was started at approximately 1 10V and the gels were pre- 

run for 10-15 minutes. Five ^1 of gel loading buffer was then added to each sample and 
mixed. Fifteen \il of each sample was then loaded onto gel. The gel was 
electrophoresed at 1 1 0V for approximately 2 hours or until the bromophenol blue marker 
approached the bottom of the gel. The gel was then transfered to Whatman filter paper, 

25 covered, and dried at 80°C for approximately 30 minutes. Autoradiography film was 
exposed to the gel overnight and developed. 

The probe alone well showed a single DNA species (unbound) of an 
apparent low molecular weight. Controls containing protein showed approximately 50% 
of the DNA at a shifted or bound position (apparent higher mwt) and 50% at the same 

30 position as the probe alone (free DNA). Samples containing test compounds showed 
ratios of bands between these two controls. A compound that completely inhibited 
protein-DNA binding appeared to be similar to that of the probe alone. Table 6 shows 
the results of this assay. Compounds which showed superior inihibition of DNA binding 
are indicated by "***," compounds which showed very good or good inhibition of DNA 

35 binding are indicated by "**" or respectively. Compounds which showed some 
inhibition of DNA binding are indicated by 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no 
5 more than routine experimentation, numerous equivalents to the specific polypeptides, 
nucleic acids, methods, assays and reagents described herein. Such equivalents are 
considered to be within the scope of this invention and are covered by the following 
claims. 
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TABLE 5 



ID 


i — aH^HffHpnnai 


BH 





Bi 


■ 

CCX-Sj ****** NT " NT ** ** ND 


BJ 


V r*s 1 

dv^x? ' - - " 


BK 


C^^-O • . - » - - - ND NO 






BL 




BN 








BO 




BP 


YryVp * - - NT • kt *• • - 


BQ 


><> AVSr^ ~ . . * ND - 

1 


BR 


* HO ND 

o 


BS 


KT • NT • NT • " " ~ 






BT 




BU 


}~^y~^ ' • - NT • NT " NO ND 


BV 


Q-^Q-J . • - NT ' NT ~ " - 


BW 


•f 0 ' m ND " 


BX 


/7\A7 • • • • NT - NT - ND ND 


BY 


^0^^ - •** •* ND *" 


BZ 


Q^y-^j^^^ NDND 


CA 


gT^-, • • NT - *» - 
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ID 


STRUCTURE 




CP 




- 


1 


1 


1 


1 


1 


1 




1 
























CQ 


KT 


















CR 




NT 


• 


NT 


- 


NT 




- 


- 


NO 


CS 




NT 




NT 




NT 


• 


- 


- 


NO 


CT 




- 




















9 




NT 












NO 


• 


CU 


S 9 


NT 


NT 












ND 


ND 


CV 




NT 


• 


• 


NT 




NT 




ND 


NO 


cw 






• 


- 


• 








NO 


- 


cx 


o 
















ND 




CY 
























u 
















NO 


ND 
























CZ 
















NO 


ND 
























DA 


O 
















NO 


ND 
























DB 


















ND 


NO 




oy^x 




















DC 
















ND 




DD 








































ND 





-127- 



WO 2004/041209 



PCT/US2003/035205 



TABLE 5 



ID 
DE 


STRUCTURE 
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fi5 

1 


1st 


5=> 

1 


m 


i 


a 

I 


a 

I 


























DF 


















NO 


























DG 










































DH 




















ND 
























Dl 






















DJ 




• 


- 


• 


NT 


- 


NT 


- 


ND 


ND 


DK 










NT 




NT 




ND 


NO 


DL 


















ND 




DM 








Ml 


• 




• 


M 


NO 


ND 


ON 


















ND 


ND 
























DO 
















ND 


• 


DP 
















- 


ND 


ND 
























DO 


















NO 


ND 


DR 


















ND 


ND 


DS 


















NO 




DT 


















NO 
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DU 


STRUCTURE fj 








i 


i 


C 

1 


B 


□ 

1 


DV 


































NO 




DW 




— - 


- 


• 


- 


- 


- 


NO 


- 






















DX 
















NO 


- 






















DY 
















NO 


- 






















DZ 


















EA 




* • 


• 


NT 


• 


NT 


- 


• 


• 


EB 






- 


NT 


- 


NT 


- 


• 


- 


EC 








NT 




NT 






NO 


ED 




T — 


NT 




NT 










EF 


^ - 


T " 


NT 


- 


NT 




- 


• 


- 


EG 




T * 


NT 


- 


NT 


• 








EH 




r — 


NT 




NT 










El 








NT 




NT 




NO 


NO 


EJ 








NT 




NT 




NO 


NO 


EK 








NT 




NT 




NO 
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STRUCTURE 


Fi 


i 


i 




65 




1 


c 


□ 


EM 




NT 






NT 




NT 




NO 


NO 


EN 






• 


- 


NT 




NT 


- 


NO 




EO 






































NO 


























EQ 
















NO 


ND 
























ER 


















NO 




ES 


o 
















NO 


NO 


ET 


X. 




- 




• 




• 


- 


NO 


























EU 














- 


NO 


NO 
























EV 
















- 


NO 


NO 
























EW 
















NO 




EX 




•** 














NO 


NO 


EY 


















NO 






. 




















EZ 


















NO 




FA 


















NO 




FB 


















NO 








NT 




NT 




NT 











130 



WO 2004/041209 PCT/US2003/035205 



TABLE 5 



ID 
FD 


STRUCTURE fl 
/^<V 2 




1= 


1 


m 


1 


C 

1 


c 

1 


D 

1 


FE 


T T 


NTT • 


NT 


- 


NT 


• 


- 


- 


ND 


FG 


r 


MT ** 


NT 




NT 








NO 


FH 
















ND 




Fl 
















NO 


ND 


FJ 




- - 


- 


- 


- 


• 




ND 




FK 


































ND 




a 


o 


- • 


- 


• 


- 


- 


- 


ND 


- 


FM 
















ND 




FN 






• 


- 


- 


NT 


- 


ND 


• 


FO 






















T - 


















FP 






































FQ 






































FR 








NT 




NT 








FS 




r • 


NT 




NT 








ND 


FT 


A 


NT 












ND 
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■ 


i 


1 
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■ 


FV 
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FW 




- 


• - 




• - 


NO NO 


FX 
























NO - 


FY 




- 






• - 


NO ~ 


F2 
























NO - 


GA 




* 








NO NO 


GB 










• «• 


ND • 


GC 
























ND - 


GD 












NO NO 


GE 










* - 


NO ND 


GF 












NO ND 


GG 
























NO * 


GH 












NO NO 


Gi 












ND - 


GJ 












ND NO 
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am 
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00 
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ND 


Um 


GL 






































NO 


NO 


orVJ 


















ND 


NO 


GN 


















ND 


ND 
























GO 


















ND 




GP 


X 


NT 




NT 




NT 










GQ 




NT 




NT 




NT 










GR 




NT 




NT 




NT 








ND 


GS 


.Ax. 


NT 




NT 




NT 




- 


- 


ND 


GT 




NT 




NT 




NT 








NO 


GU 




NT 




NT 




NT 








ND 


GV 




<- 














ND 


ND 
























GW 


















NO 




GX 


















ND 




GY 






































ND 
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GZ 


















ND 






? 














- 






HA 


V 
















ND 


























HB 
















ND 




HC 


rVr 












NT 




to 




HD 




* 






* 


*• 


*** 


*• 


ND 


* 




>^ 




















HE 










































HF 




















NO 
























HG 










































HH 


N — 










































HI 












































HJ 


NT 


















nrv 




NT 




NT 




NT 










HL 




NT 


Ml 


NT 




NT 




M 


„ 


























HM 






NT 












HD 




HN 






NT 












NO 


NO 
























HO 




NT 












ND 


ND 


HP 


J04 


• 


NT 










NO 
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NT 




NT 










HR 
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NT 


• 


NT 




NT 


• 


- 


- 




HS 










NT 




NT 




KO 




HT 


T 








NT 




NT 




ND 




HU 


















ND 




HV 


<^&^ 
















NO 




HW 


















ND 




HX 


o 
















ND 


























HY 
















ND 
























T 


HZ 
















NO 






o 




















IA 


















ND 




IB 


















ND 


ND 


IC 


















ND 


ND 
























10 
















ND 
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0 CV im 
• ND - 
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• NO ND 


IG 












• ND NO 








IH 












• NO • 


11 






M M « 






• NO - 


U 












• ND " 


IK 




- * * 


♦ * - 




* NO NO 


IL 






- ■ « 




* ND - 


IM 




& . 








* ND *■ 












IN 












• ND NO 










IO 










" ND ND 








IP 












* ND NO 


IQ 










- ND ND 


IR 












- ND - 
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1 


l 


D 




B 


B 


ND 




IT 
II 


o 


































HI 


















NO 


NO 












IV 


*txP 
















ND 


NO 














IW 


















ND 


NO 












IX 


°^ 


- 




• 










NO 




IY 




















NO 




















tz 




- 












- 


NO 


• 












JA 




















ND 


• 


















JB 




— 


M 












NO 


ND 


JD 




















ND 














JE 


■\x>o 


NT 




NT 




NT 










JF 




NT 




NT 




NT 






• 




JG 




NT 




NT 




NT 








NO 


JH 




NT 




NT 




NT 










Jl 


< 


NT 




NT 




NT 
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* KkA *m 


4 tat ft* 


1 u c 


> CV Lm 

NO - 


KV 










NO ' 


KW 










' NO NO 








KX 












• NO ND 










KY 










1 ND *• 








KZ 












LA 










- ND ND 


LB 






- • 






* ND ND 


LC 


cPCpo 








- ND ND 








ID 










* ND * 


LE 












- NO ND 


LF 


Wo 








- ND NO 








LG 










" ND * 








LH 




— •* 


• * 


• * 


- ND • 


LI 










*• NO • 


U 




n • t 


rr - r 


IT • 




LK 




4T * f 


it • i 


IT • 
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NT 


• N 


« tad 

lis 
r • 


torn 
NT 




) cv 
- 


LM 


Ik, 

6 


NT 


•* N 


T *~ 


NT 






LN 


> 


••, 


M * 


* * 






• ND 


LO 














• ND 














































* ND 


LR 
LS 






M * 








- ND 


LT 














- ND 


LU 












NT 


- ND 




oo 














LV 














LW 








IT # 








LX 








IT ' 


• 




- - 


















LY 
































L2 
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MmA 

"SI 


tanc* 




Lmm 




Rat 


CO 


cv 




MA 


O 


• 


• 


- 


NT 


• 


NT 


- 


• 


• 


MB 










NT 




NT 






ND 
























MC 






















MD 




NT 




NT 




NT 


- 


- 


- 


• 


ME 




NT 




NT 




NT 










MF 




NT 


*** 


NT 




NT 








ND 


MG 




NT 


• 


NT 


- 


NT 


- 




- 


• 


MH 




NT 




NT 




NT 










Ml 




• 


NT 












ND 


- 


MJ 






NT 












NO 




MK 




- 


NT 


- 


- 








ND 


ND 


ML 




* 


NT 












ND 




MM 






NT 










- 


ND 


ND 


MN 


6 


NT 


• 


NT 


- 


NT 


• 


- 


• 


ND 


MO 




NT 




NT 




NT 










MP 


0^ 


NT 




NT 




NT 
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not 






M 


DIM 




CD 


cv 


Vm 


MQ 




HT 




NT 




NT 




- 




NO 


MR 














NT 




NO 




MS 




NT 


• 


• 


NT 


• 


NT 


- 


NO 


NO 


MT 




• 


• 


- 


NT 


• 


NT 


- 


NO 


NO 


MU 




- 




- 




- 


• 


- 


NO 


• 


MV 


















NO 


- 
























MX 
















NO 


NO 


MY 






















MZ 


io-o 
















NO 




MA 


















NO 


























NB 
















NO 




NC 


'Xv ~ 
















NO 


























NO 
















NO 


NO 




Vrvh 




















NE 
















NO 




NF 


















NO 




NG 




- 








M 




- 


NO 


m 


NH 


















NO 


ND 




6q-CK 




















Nl 
















NO 


NO 
























NJ 
















NO 


ND 
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ID 

NK 


STRUCTURE H 

rOU . 1 


3& 
■■ 


HI 


=te3 
■■ 


us 
H 


ai 
■1 


3D 

M 














HU NU 


ML 


°\ 










- » 


40 * 


NM 




























40 " 


NN 


o6ir 










- f 


4D ' 


Kin 




























ID * 


NP 


/ft 










- r 


10 NO 


NQ 














O NO 




-Ax 














NR 












D ND 


















NS 














10 NO 


















NT 












D NO 


















NU 












D ND 


INV 


A, 










" N 


D NO 


NW 












" N 


D * 


















NX 














D - 


NY 












~ M 


o • 


NZ 












- M 


0 NO 
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ID 

OA 


STRUCTURE JSSS. iea JSi. JHl i» 

A - mm 


OB 




OC 




00 


jP<^ 


OE 




OF 


A-K) NT • NT • NT - - * ND 




0■-<V^-• J C^ nt • nt • nt • - • - 


OH 


fVlA^ NT • NT • NT * * - 


01 


\ JL / ur • MT ~ MT " ** " ND 


OJ 




OK 


Cy^~CfZ NT - NT - NT " - "ND 


OL 


a 

/~yJ^f\ NT • NT " NT ~ ~ ~ * 


OM 


0-*^YY*> NT • NT ** NT *** ** * ND 


ON 




OO 


x \o ■ • 


OP 
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ID 

PC 


STRUCTURE 

-o-o 


MarA 

jssh. 

NT 


MM 
Um 


tax) 

WW 

NT 


Um* 


mm 

jam 

NT 


M 


00 


cv 




PO 






NT 












NO 


ND 


PD 






NT 












NO 




PE 






NT 












NO 


NO 


PF 






NT 












ND 




PG 




• 


NT 












NO 


NO 


PH 




• 


NT 












ND 


ND 


PI 


•A 




NT 












ND 


ND 


PJ 






NT 












ND 




PK 




NT 




NT 




NT 










PL 




NT 


• 


NT 


- 


NT 


• 


- 


- 




PM 




NT 


• 


NT 




NT 


• 




• 


- 


PN 




NT 




NT 




NT 










PO 




NT 




NT 




NT 








ND 
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- • 


Ml t«S 1 

**, Uaa. X 

• nt 


tab ft* 

• NT 


BD CV tm 
- ND NO 


PQ 




- - 


** NT 


- NT 


" NO NO 


PR 






• wr 


• NT 


" NO ND 


PS 






- NT 


• NT 


m ND ~ 


PT 










- ND ND 


PU 










- NO - 


rV 










* ND ** 


PW 










* ND ' 


PX 


u 








• ND - 


PY 






• * - 




• ND - 


PZ 


o 








• ND - 


OA 










• ND - 


QB 


^>^~ • 








ND * 


QC 


sj&^ - 








NO ND 
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QE 



OF 
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QH 
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QJ 
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QL 
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QO 
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OR 
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-Y-Y--TO 
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00 


NO 


NO 


QU 




















• 


















NO 






7 u 




















QV 
















NO 




QW 


















NO 


NO 
























QX 
















NO 


NO 




kj T 




















QY 
















NO 


NO 
























QZ 
















NO 


ND 
























RA 
















NO 


- 
























RB 
















NO 


• 
























RC 
















NO 


NO 


RD 




NT 




NT 




NT 


— 


- 




ND 


RE 




NT 


# 


NT 




NT 








NO 


RF 




NT 




NT 




NT 










RG 


O 


NT 




NT 




NT 










RH 




NT 




NT 




NT 










R) 


0 1^ 


NT 




NT 




NT 








NO 


RJ 




NT 




NT 




NT 










RK 




NT 




NT 




NT 










RL 




NT 




NT 




NT 








NO 


RM 




NT 




NT 




NT 








NO 
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NT 
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NT 


tod 
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NT 


Urn 


00 


ev 




RO 




NT 




NT 




NT 










RP 




NT 


- 


NT 


- 


NT 




- 


• 


- 


RQ 




NT 


- 


NT 




NT 


• 


- 


• 


- 


RS 


















NO 


























RT 




















RU 






- 












ND 




RV 














• 




NO 




RW 


















NO 




RX 


















NO 


ND 


RY 


c 
















NO 


























RZ 
















ND 






f 




















SA 


















ND 
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00 


cv 
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SB 
















NO 




SC 


















NO 




SO 
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NO 




SE 


0 
















NO 


NO 


SF 


















ND 


ND 


SG 




• 










NT 




NO 






X 




















SH 














NT 




ND 


ND 


SI 








- 




• 


NT 


- 


ND 


ND 


SJ 


9, 


















ND 
























SK 




















ND 
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Tl 


9. 
















ND 




TJ 
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ND 




TK 
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ND 


ND 


TL 


















ND 


ND 


TM 


















ND 
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ND 


ND 
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ND 




TP 






• 


• 


• 




- 


- 


NO 


NO 


TQ 






• 


• 


- 






- 


ND 


• 


TR 




- 


• 


• 


- 




- 


- 


ND 


ND 


TS 


o 
















ND 


ND 
























TT 


6 
















ND 


ND 
























TU 
















NO 




TV 
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ND 




TW 


















ND 




TX 


















ND 


























TY 
















NO 


ND 



156- 



WO 2004/041209 



PCT/US2003/035205 



TABLES 



ID 

TZ 


STRUCTURE 


— 




• 


• 


— 


* 


CO 
** 


NO 


1mm 

ND 


UA 


Y 






• 


• 
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ND 


ND 
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NT 




NT 


• 


NT 
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• 


• 


UC 




NT 




NT 


- 


NT 




- 


- 


- 


UD 


y ■ 


NT 




NT 




NT 




- 


- 


- 


UE 


















ND 




UF 




• 


- 


- 


- 


- 


• 




ND 






°& 




















UG 


v 
















ND 




UH 


















ND 


- 


U) 




- 




** 


• 


- 


- 


- 


NO 


• 


UJ 


A- 
















ND 


ND 
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ND 
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CLAIMS 



1 . A method for reducing antibiotic resistance of a microbial cell, comprising 
contacting said cell with a transcription factor modulating compound of the 
5 formula (Va): 




(Va) 
wherein 

10 R 1 is OH, OCOCO2H, or a substituted or unsubstituted straight or branched Q- 

C5 alkyloxy group; 

R 2 is H, C02(Ci-C5 substituted or unsubstituted, straight or branched alkyl), or a 

substituted or unsubstituted aryl group; and 
R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 
15 (C1-C5 substituted or unsubstituted, straight or branched alkyl), 

C02(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 

CO(Ci-Cs substituted or unsubstituted, straight or branched alkyl), 

CO(substituted or unsubstituted aryl or heteroaryl), 

CO(C3-C6 substituted or unsubstituted cycloalkyl), 
20 0(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 

C(NOH)(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 

substituted or unsubstituted amino, C0 2 H, CN, N0 2 , CONH 2 , (CO)(NHOH), 

and halogen; 

provided that when R 6 is N0 2 and R 2 is unsubstituted phenyl, then R 1 is not 
25 0(CHCH3)(C0 2 )CH 2 CH3 or OCH 2 C02H; 

provided that when R 6 is H or N0 2 , then R 1 is not a phenyl-substituted alkyloxy 
group; 

provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is /;ara-methoxyphenyl, 
then R 1 is not OH; and 

30 provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, 

then R 1 is not OCH 2 C0 2 CH 2 CH3; 
such that the antibiotic resistance of said cell is reduced. 
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2. A method for modulating a transcription, comprising contacting a transcription 
factor with a transcription factor modulating compound of the formula (Va): 



R l is OH, OCOCO2H, or a substituted or unsubstituted straight or branched Cp 
C5 alkyloxy group; 

R 2 is H, C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), or a 
substituted or unsubstituted aryl group; and 

R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 
(C1-C5 substituted or unsubstituted, straight or branched alkyl), 
C02(CrCs substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 
CO(C3-C6 substituted or unsubstituted cycloalkyl), 
0(Ci-Cs substituted or unsubstituted, straight or branched alkyl), 
C(NOH)(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
substituted or unsubstituted amino, CO2H, CN, N0 2 , CONH 2 , (CO)(NHOH), 
and halogen; 

provided that when R 6 is NO2 and R 2 is unsubstituted phenyl, then R l is not 

0(CHCH 3 )(C02)CH 2 CH3 or OCH 2 C0 2 H; 
provided that when R 6 is H or N0 2 , then R l is not a phenyl-substituted alkyloxy 

group; 

provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is /?ara-methoxyphenyl, 

then R 1 is not OH; and 
provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, 

then R 1 is not 0CH 2 CO2CH 2 CH3; 
such that the transcription is modulated. 

3. The method of claim 1 or 2, wherein R 4 , R 5 , and R 7 are all H. 



5 




wherein 
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4. The method of claim 1 or 2, wherein R 1 is selected from the group consisting of 
OH, 0(CR'R")i-3H, OCCR'R'VsOH, 0(CR'R")i- 3 C02H, 
0(CR'R")i-3C0 2 (CR'R")i-3H,0(CR , R")i-3(CO)NH2, 
©(CR'R'Oi-sCCNrDNHa, OCOC0 2 H, 0(CR'R")i-3S03H, 

5 0(CR'R")i- 3 OS0 3 H, 0(CR'R")i- 3 P0 3 H, 0(CR'R")i_ 3 OP0 3 H, 

0(CR'R")i- 3 N[(CR'R'V3H]2, 0(CR'R")i- 3 (CO)(NHOH), 
and 0(CR'R")i-3(heteroaryl); 

wherein R' and R" are each independently H, a C1-C3 alkyl, C2-C 3 alkenyl, or 
C2-C3 alkynyl group. 

10 

5 . The method of claim 4, wherein each R' and R" is independently H or CH 3 . 

6. The method of claim 4, wherein R 1 is 0(CR'R")i-3(heteroaryl), and wherein said 
heteroaryl group is a pyrrolyl, furanyl, thiophenyl, thiazolyl, isothiaozolyl, 

1 5 imidazolyl, triazolyl, tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, 

pyrazinyl, pyridazinyl, or pyrimidinyl group. 

7. The method of claim 1 or 2, wherein R 2 is a substituted or unsubstituted phenyl, 
pyrrolyl, furanyl, thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, 

20 tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or 

pyrimidinyl group. 

8. The method of claim 3, wherein R 6 is H, (CR'R")i-3H, (CR'R")i- 3 OH, 
(CR'R")i- 3 NH2, (NOH)(CR'R")i_ 3 H, CO(CR'R'V 3 NH 2 , 

25 CO(CR'R")i- 3 H, CO(CR'R")i- 3 OH, CO(CR'R")o-3CF 3 , 

(CR'R")o-3N[(CR'R")o-3H]2, CO(substituted or unsubstituted heteroaryl), 
CO(C 3 -C 6 substituted or unsubstituted cycloalkyl), 0(CR'R")i-3H, 
CO(substituted or unsubstituted phenyl), C0 2 (CR'R")o-3H, CN, 
N0 2) F,Cl,Br,orI, 

30 wherein R' and R" are each independently H, a Ci-C 3 alkyl, C2-C 3 alkenyl, or 

C2-C 3 alkynyl group. 

9. The method of claim 8, wherein each R' and R" is independently H or CH 3 . 



35 10. The method of claim 8, wherein R 6 is CO(substituted or unsubstituted 

heteroaryl), and wherein said heteroaryl group is. a pyrrolyl, furanyl, thiophenyl, 
thiazolyl, isothiaozolyl, imidazolyl, triazolyl, tetrazolyl, pyrazolyl, oxazolyl, 
isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or pyrimidinyl group. 
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1 1 . The method of claim 1 or 2, wherein said transcription factor is a helix-turn-helix 
protein. 

12. The method of claim 1 or 2, wherein said transcription factor is a transcriptional 
5 activation factor. 

13. The method of claims 12, wherein said transcriptional activation factor is an 
AraC family polypeptide. 

10 14. The method of claim 12, wherein said transcriptional activation factor is a Mar A 
family polypeptide. 

1 5 . The method of claim 1 or 2, wherein said transcription factor modulating 
compound is a transcription factor inhibiting compound. 

15 

16. The method of claim 1 or 2, wherein said transcription factor is prokaiyotic. 

17. The method of claim 14, wherein said MarA family polypeptide is MarA, SoxS, 
or Rob. 

20 

18. The method of claim 1, wherein said microbial cell is selected from the group 
consisting of Pseudomonas aeruginosa, Pseudomonas fluorescens, Pseudomonas 
acidovorans, Pseudomonas alcaligenes, Pseudomonas putida, 
Stenotrophomonas maltophilia, Burkholderia cepacia, Aeromonas hydrophilia, 

25 Escherichia coli, Citrobacter freundii, Salmonella typhimurium, Salmonella 

typhi, Salmonella paratyphi, Salmonella enteritidis, Shigella dysenteriae, 
Shigella Jlexneri, Shigella sonnei, Enterobacter cloacae, Enterobacter 
aerogenes, Klebsiella pneumoniae, Klebsiella oxytoca 9 Serratia marcescens, 
Francisella tularensis, Morganella morganii, Proteus mirabilis, Proteus 

30 vulgaris, Providencia alcalifaciens, Providencia rettgeri, Providencia stuartii, 

Acinetobacter calcoaceticus, Acinetobacter haemolyticus, Yersinia 
enterocolitica, Yersinia pestis, Yersinia pseudotuberculosis, Yersinia intermedia, 
Bordetella pertussis, Bordetella parapertussis, Bordetella bronchiseptica, 
Haemophilus influenzae, Haemophilus parainfluenzae, Haemophilus 

35 haemolyticus, Haemophilus parahaemolyticus, Haemophilus ducreyi, 

Pasteurella multocida, Pasteurella haemolytica 9 Branhamella catarrhalis, 
Helicobacter pylori, Campylobacter fetus, Campylobacter jejuni, Campylobacter 
coli, Borrelia burgdorferi, Vibrio cholerae, Yibrio parahaemolyticus, Legionella 
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pneumophila, Listeria monocytogenes , Neisseria gonorrhoeae, Neisseria 
meningitidis, Gardnerella vaginalis, Bacteroides fragilis, Bacteroides distasonis, 
Bacteroides 3452A homology group, Bacteroides vulgatus, Bacteroides ovalus, 
Bacteroides thetaiotaomicron, Bacteroides uniformis, Bacteroides eggerthii, 

5 Bacteroides splanchnicus, Clostridium difficile, Mycobacterium tuberculosis, 

Mycobacterium avium, Mycobacterium intracellulare, Mycobacterium leprae, 
Corynebacterium diphtheriae, Corynebacterium ulcerans, Streptococcus 
pneumoniae, Streptococcus agalactiae, Streptococcus pyogenes, Enterococcus 
faecalis, Enterococcus faecium, Staphylococcus aureus, Staphylococcus 

10 epidermidis, Staphylococcus saprophytics, Staphylococcus intermedins, 

Staphylococcus hyicus subsp. hyicus, Staphylococcus haemolyticus, 
Staphylococcus hominis, and Staphylococcus saccharolyticus. 



19. The method of claim 1 or 2, wherein said compound is a compound of Table 6 or 
15 Table 7. 



20. A pharmaceutical composition comprising a pharmaceutically acceptable carrier 
and a transcription factor modulating compound, wherein said compound is of 
the formula (Va): 



20 







R 






A 


Ft 5 "" 








R 4 





(Va) 



wherein 

R 1 is OH, OCOC0 2 H, or a substituted straight or branched C1-C5 alkyloxy group, 
25 provided that R 1 is not a 2-amino-substituted ethoxy group or a substituted or 

unsubstituted benzyloxy group; 
R 2 is H, C02(Ci-Cs substituted or unsubstituted, straight or branched alkyl), or a 
substituted or unsubstituted aryl group, provided that said aryl group is not a 
thiazolyl or isothiazolyl group; and 
30 R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 

(C1-C5 substituted or unsubstituted, straight or branched alkyl), 
CC>2(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 
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CO(C3-C6 substituted or unsubstituted cycloalkyl), 
0(Ci-Cs substituted or unsubstituted, straight or branched alkyl), 
C(NOH)(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
substituted or unsubstituted amino, CO2H, CN, N0 2 , CONH 2 , (CO)(NHOH), 
5 and halogen; 

provided that when R 6 is NO2 and R 2 is unsubstituted phenyl, then R 1 is not 
0(CHCH 3 )(C02)CH 2 CH3 or OCH 2 C0 2 H; 

provided that when R 6 is H or N0 2 , then R l is not a phenyl-substituted alkyloxy 
group; 

1 0 provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is />ara-methoxyphenyl, 

then R 1 is not OH; and 
provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, or 

when R 4 , R 5 , and R 7 are all H, R 6 is CI, and R 2 is jpara-methyl-phenyl, then 

R l is not OCH 2 C0 2 CH 2 CH 3 ; 
15 or pharmaceutical^ acceptable salts thereof. 

21 . The pharmaceutical composition of claim 20, wherein R 4 , R 5 , and R 7 are all H. 

22. The pharmaceutical composition of claim 20, wherein R 1 is selected from the 
20 group consisting of OH, 0(CR'R")i-3H, 0(CR'R")i-30H, 0(CR'R")i-3C0 2 H, 

OCCR'R'^^aCOzCCR'R'Oi-aH, 0(CR'R")i- 3 (CO)NH 2 , 
OCCR'R^i-aCCNH)^, OCOC0 2 H, 0(CR'R")i- 3 S03H, 
0(CR'R")i-30S0 3 H, 0(CR'R")i- 3 P0 3 H, 0(CR'R")i-30P03H, 
0(CR , R")wN[(CR , R'V3H] 2 , O(CR'R'0i- 3 (CO)(NHOH), 
25 and 0(CR'R")i-3(heteroaryl); 

wherein R 5 and R" are each independently H, a C\-C$ alkyl, C2-C3 alkenyl, or 
C2-C3 alkynyl group. 

23 . The pharmaceutical composition of claim 22, wherein each R' and R" is 
30 independently H or CH3. 

24. The pharmaceutical composition of claim 22, wherein R 1 is 0(CR'R")i- 
3 (heteroaryl), and wherein said heteroaryl group is a pyrrolyl, furanyl, thiophenyl, 
thiazolyl, isothiaozolyl, imidazolyl, triazolyl, tetrazolyl, pyrazolyl, oxazolyl, 

35 isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or pyrimidinyl group. 
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25. The pharmaceutical composition of claim 20, wherein R 2 is a substituted or 
unsubstituted phenyl, pyrrolyl, furanyl, thiophenyl, imidazolyl, triazolyl, 
tetrazolyl, pyrazolyl, oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or 
pyrimidinyl group. 

5 

26. The pharmaceutical composition of claim 20, wherein R 6 is H, (CR'R")i-3H, 
(CR'R'Oi-sOH, (CR'R")i-3NH 2 , (NOH)(CR'R")i-3H, CO(CR'R")o-3NH 2 , 
CO(CR'R")i-3H, COCCR'R'Oi-aOH, CO(CR'R")o-3CF 3 , 
(CR , R")o-.3N[(CR , R")o-3H] 2 , CO(substituted or unsubstituted heteroaryl), 

10 CO(C 3 -C 6 substituted or unsubstituted cycloalkyl), 0(CR'R")i-3H, 

CO(substituted or unsubstituted phenyl), C0 2 (CR , R")o-3H, CN, 
N0 2 , F, CI, Br, or I, 

wherein R' and R" are each independently H, a C1-C3 alkyl, C2-C3 alkenyl, or 
C2-C3 alkynyl group. 

15 

27. The pharmaceutical composition of claim 26, wherein each R' and R" is 
independently H or CH3. 

28. The pharmaceutical composition of claim 27, wherein R 6 is CO(substituted or 
20 unsubstituted heteroaryl), wherein said heteroaryl group is a pyrrolyl, furanyl, 

thiophenyl, thiazolyl, isothiaozolyl, imidazolyl, triazolyl, tetrazolyl, pyrazolyl, 
oxazolyl, isooxazolyl, pyridinyl, pyrazinyl, pyridazinyl, or pyrimidinyl group. 



29. The pharmaceutical composition of claim 20, further comprising an antibiotic. 

25 

30. A pharmaceutical composition of claim 20, wherein said effective amount is 
effective to treat a biofilm associated state in said subject. 



3 1 . The pharmaceutical composition of claim 30, wherein said biofilm associated 
30 state is selected from the group consisting of middle ear infections, cystic 

fibrosis, osteomyelitis, acne, dental cavities, endocarditis, and prostatitis. 



32. The pharmaceutical composition of claim 20, wherein said compound is a 
compound of Table 6 or Table 7. 



35 
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33. A method of inhibiting a biofilm, comprising administering a composition 
comprising a transcription factor modulating compound of the formula 























R 4 





(Va) 

5 wherein 

R l is OH, OCOC0 2 H, or a substituted or unsubstituted straight or branched Ci- 
C 5 alkyloxy group; 

R 2 is H, C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), or a 
substituted or unsubstituted aryl group; and 

10 R 4 , R 5 , R 6 , and R 7 are independently selected fiom the group consisting of H, 

(C1-C5 substituted or unsubstituted, straight or branched alkyl), 
CC>2(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 

1 5 CO(C3-C6 substituted or unsubstituted cycloalkyl), 

0(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
C(NOH)(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
substituted or unsubstituted amino, C0 2 H, CN, N0 2 , CONH 2 , (CO)(NHOH), 
and halogen; 

20 provided that when R 6 is NO2 and R 2 is unsubstituted phenyl, then R l is not 

0(CHCH 3 )(C0 2 )CH 2 CH 3 or OCH 2 C0 2 H; 
provided that when R 6 is H or N0 2 , then R 1 is not a phenyl-substituted alkyloxy 
group; 

provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is ^ara-methoxyphenyl, 
25 then R 1 is not OH; and 

provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, 

then R 1 is not OCH 2 C0 2 CH 2 CH 3 ; 
such that said biofilm is inhibited. 

30 34. The method of claim 33, wherein said transcription factor modulating compound 
is a compound of Table 6 or Table 7. 



35. 



The method of claim 33, wherein said composition further comprises a 
surfactant. 
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36. 



5 

37. 
38. 

10 



15 



20 



25 



30 



The method of claim 35, wherein said surfactant is Sodium Dodecyl Sulfate; 
Quaternary Ammonium Compounds; alkyl pyridinium iodides; Tween 80, Tween 
85, Triton X 100; Brij 56; biological surfactants; Rhamnolipid, Surfactin, 
Visconsin, or sulfonates. 

The method of claim 36, wherein said biofilm development is diminished by the 
administration of said composition. 

A method of inhibiting the formation of a biofilm, comprising administering a 
transcription factor modulating compound of the formula (Va): 



R 1 is OH, OCOCO2H, or a substituted or unsubstituted straight or branched C r 
C 5 alkyloxy group; 

R 2 is H, C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), or a 
substituted or unsubstituted aryl group; and 

R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 
(C1-C5 substituted or unsubstituted, straight or branched alkyl), 
C0 2 (Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 
CO(C3-C6 substituted or unsubstituted cycloalkyl), 
0(C r C 5 substituted or unsubstituted, straight or branched alkyl), 
C(NOH)(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
substituted or unsubstituted amino, C0 2 H, CN, N0 2 , CONH 2 , (CO)(NHOH), 
and halogen; 

provided that when R 6 is NC^ and R 2 is unsubstituted phenyl, then R 1 is not 

0(CHCH3)(C0 2 )CH 2 CH 3 or OCH 2 C0 2 H; 
provided that when R 6 is H or N0 2 , then R l is not a phenyl-substituted alkyloxy 

group; 

provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is /rara-metiioxyphenyl, 
then R l is not OH; and 
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provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, 

then R 1 is not OCH2CO2CH2CH3; 
such that the formation of said biofilm is inhibited. 

39. A method for cleaning and disinfecting contact lenses comprising administering a 
composition comprising an acceptable carrier and a transcription factor 
modulating compound of the formula (Va): 




(Va) 



wherein 

R 1 is OH, OCOCO2H, or a substituted or unsubstituted straight or branched Ci- 
C 5 alkyloxy group; 

R 2 is H, C0 2 (Ci-Cs substituted or unsubstituted, straight or branched alkyl), or a 
substituted or unsubstituted aryl group; and 

R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 
(C1-C5 substituted or unsubstituted, straight or branched alkyl), 
C02(Ci-Cs substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 
CO(C3-C6 substituted or unsubstituted cycloalkyl), 
0(Ci-Cs substituted or unsubstituted, straight or branched alkyl), 
C(NOH)(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
substituted or unsubstituted amino, C0 2 H, CN, N0 2 , CONH 2 , (CO)(NHOH), 
and halogen; 

provided that when R 6 is N0 2 and R 2 is unsubstituted phenyl, then R l is not 

0(CHCH 3 )(C02)CH 2 CH 3 or OCH 2 C0 2 H; 
provided that when R 6 is H or N0 2 , then R 1 is not a phenyl-substituted alkyloxy 

group; 

provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is pa ra-methoxyphenyl, 

then R 1 is not OH; and 
provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, 

then R 1 is not OCH2CO2CH2CH3 ; 
such that said contact lenses are cleaned and disinfected. 
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40. A method of treating medical indwelling devices comprising administering a 
composition comprising a transcription factor modulating compound of the 
formula (Va): 



30 




(Va) 
wherein 

R 1 is OH, OCOCO2H, or a substituted or unsubstituted straight or branched Ci- 
C5 alkyloxy group; 

R 2 is H, C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), or a 
1 0 substituted or unsubstituted aryl group; and 

R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 
(C1-C5 substituted or unsubstituted, straight or branched alkyl), 
C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
15 CO(substituted or unsubstituted aryl or heteroaryl), 

CO(C 3 -C 6 substituted or unsubstituted cycloalkyl), 
0(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
C(NOH)(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
substituted or unsubstituted amino, C0 2 H, CN, N0 2 , CONH 2 , (CO)(NHOH), 
20 and halogen; 

provided that when R 6 is NCfe and R 2 is unsubstituted phenyl, then R 1 is not 

0(CHCH 3 )(C0 2 )CH 2 CH 3 or OCH 2 C0 2 H; 
provided that when R 6 is H or N0 2 , then R 1 is not a phenyl-substituted alkyloxy 
group; 

25 provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 ispara-methoxyphenyl, 

thenR 1 is not OH; and 
provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, 

then R 1 is not OC^CX^OfeCHB; 
such that said medical indwelling devices are treated. 



41 . The method of claim 40, wherein said device is selected from the group 
consisting of catheters, orthopedic devices and implants. 
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A method for treating or preventing a biofilm associated state in asubject, 
comprising administering to said subject an effective amount of a transcription 
factor modulating compound of the formula (Va): 



R 1 is OH, OCOCO2H, or a substituted or unsubstituted straight or branched Q- 
C5 alkyloxy group; 

R 2 is H, C02(Ci-C5 substituted or unsubstituted, straight or branched alkyl), or a 
substituted or unsubstituted aryl group; and 

R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 
(C1-C5 substituted or unsubstituted, straight or branched alkyl), 
C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 
CO(C 3 -C6 substituted or unsubstituted cycloalkyl), 
0(Ci-C 5 substituted or unsubstituted, straight or branched alkyl), 
C(NOH)(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
substituted or unsubstituted amino, C0 2 H, CN, N0 2 , CONH 2 , (CO)(NHOH), 
and halogen; 

provided that when R 6 is N0 2 and R 2 is unsubstituted phenyl, then R l is not 

0(CHCH 3 )(C0 2 )CH 2 CH 3 or OCH 2 C0 2 H; 
provided that when R 6 is H or N0 2 , then R 1 is not a phenyl-substituted alkyloxy 

group; 

provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is /?ara-methoxyphenyl, 

then R 1 is not OH; and 
provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, 

then R l is not OCH 2 C0 2 CH 2 CH 3 ; 
such that said biofilm associated state in said subject is treated. 

The method of claim 42, wherein said biofilm associated state is selected from 
the group consisting of middle ear infections, cystic fibrosis, osteomyelitis, acne, 
dental cavities, endocarditis, and prostatitis. 




(Va) 



wherein 
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44. The method of claim 42, further comprising administering a pharmaceutical 
acceptable carrier. 

45. The method of claim 42, wherein said subject is a mammal. 

5 

46. The method of claim 42, wherein said mammal is a human. . 

47. The method of claim 42, wherein said subject is immunocompromised. 

10 48. A method for preventing a bacterial associated state in a subject, comprising 
administering to said subject an effective amount of a transcription factor 
modulating compound of the formula (Va): 




/ 

(Va) 

15 wherein 

R 1 is OH, OCOC0 2 H, or a substituted or unsubstituted straight or branched Q- 
C 5 alkyloxy group; 

R 2 is H, C0 2 (Ci-C 5 substituted or unsubstituted, straight or branched alkyl), or a 
substituted or unsubstituted aryl group; and 

20 R 4 , R 5 , R 6 , and R 7 are independently selected from the group consisting of H, 

(C1-C5 substituted or unsubstituted, straight or branched alkyl), 
C0 2 (Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
CO(substituted or unsubstituted aryl or heteroaryl), 

25 CO(C3-C6 substituted or unsubstituted cycloalkyl), 

0(Ci-Cs substituted or unsubstituted, straight or branched alkyl), 
C(NOH)(Ci-C5 substituted or unsubstituted, straight or branched alkyl), 
substituted or unsubstituted amino, C0 2 H, CN, NO2, CONH 2 , (CO)(NHOH), 
and halogen; 

30 provided that when R 6 is N0 2 and R 2 is unsubstituted phenyl, then R 1 is not 

0(CHCH 3 )(C0 2 )CH 2 CH 3 or OCH 2 C0 2 H; 
provided that when R 6 is H or N0 2 , then R l is not a phenyl-substituted alkyloxy 
group; 
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provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is /?ara-methoxyphenyl, 

thenR 1 is not OH; and 
provided that when R 4 , R 5 , R 6 , and R 7 are all H and R 2 is unsubstituted phenyl, 

then R 1 is not OCH2CO2CH2CH3; 
5 such that the bacterial associated state in said subject is prevented. 



49. The method of claim 48, wherein said subject is a human. 

50. The method of claim 48, wherein said transcription factor modulating compound 
10 is a MarA family polypeptide inhibitor. 

5 1 . The method of claim 48, wherein said transcription factor modulating compound 
is a AraC family polypeptide inhibitor. 

15 52. A method for reducing antibiotic resistance of a microbial cell, comprising 

contacting said cell with a transcription factor modulating compound of Table 8, 
such that the antibiotic resistance of said cell is reduced. 



53. A method for modulating a transcription, comprising contacting a transcription 
20 factor with a transcription factor modulating compound of Table 8, such that 

transcription is modulated. 

54. A pharmaceutical composition comprising a pharmaceutically acceptable carrier 
and a transcription factor modulating compound, wherein said compound is of 

25 Table 8. 



55. A method of inhibiting the formation of a biofilm, comprising administering a 
transcription factor modulating compound of Table 8, such that a biofilm is 
inhibited. 

30 

56. A method for preventing a bacterial associated state in a subject, comprising 
administering to said subject an effective amount of a transcription factor 
modulating compound of Table 8. such that a bacterial associated state is 
prevented. 

35 
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57. A method for treating or preventing a biofilm associated state in a subject, 
comprising administering to said subject an effective amount of a transcription 
factor modulating compound of Table 8, such that said biofilm associated state 
treated or prevented. 

5 

58. A compound of Table 6, Table 7, or Table 8. 
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Multiple sequence alignment of AraC-XylS family members. 



AARP_PROST/ 22-120 
ADAJECOLI/85-183 
ADAjvlYCTU/ 87-185 
ADA_SALTY/94-183 
ADAA_BACSU/ 102-200 
ADIY_ECOLl/ 14 9-246 
AGGR_ECOL I / 164-261 
APPY_ECOLl/l39-236 
ARAC_CITFR/ 180-279 
ARACJECOLI/180-279 
ARAC_ERWCH/ 186-284 
ARAC_SALTY /180-279 
ARAL_STRAT/ 2 02 - 3 0 0 
ARAL_STRLl/202-300 
CAFR_YERPE/ 8-107 
CELD JSCOL 1/168-274 
CFADJBCOLI / 164-261 
CSVR JECOL 1/166-263 
ENVY_ECOLI / 14 9 - 2 4 6 
EUTR_ECOLl/243-344 
EUTR_SALTY/ 24 3 - 344 
BXSA_PSEAE/ 171-269 
FAPR_ECOLI /154-251 
FEAR_ECOI»I / 1 9 9 - 2 9 9 
GADX_EC027/l45-242 
GADX_EC057/l45-242 
GADX_ECOLl/l45-242 
GLXA_RHIME/223-321 
HRPB_RALSO/375-477 
4 INVF_SALTY/112-210 
LACR_STAX Y/ 174-272 
LCRF_YERPE /167-265 
LUMQ_PHOLE/ 148-246 
MARA_ECOLl/l4-112 
MARA_SALTY/ 14 - 1 12 
MEIiR_ECOLI/ 194 -2 92 
MMS R_P S E AE / 201-299 
MSMR_STRMU/ 176-274 
MXIE_SHIFL/99-199 
MXIEJSHISO/ 99-199 
ORUR_PSEAE/241-338 
PCHR_PSEAE/201-296 
PERA_EC027/l68-265 
POCR_SALTY/ 195-293 
PQRA_PROVU/ 7-107 
RAFR_PEDPE/l76-274 
RAMA_ENTCL/ 9-107 
RAMA_KLEPN/ 9-107 
RHAR_E COL I / 2 0 9 - 3 0 7 
RHARJSALTY/179-277 
RHAS_ECOLl/l74-272 



SEIIiVWIEGNLTNR . . 
DKITHACRLLEQETp . 
ARAMRLI ADGTVDR . . 

LEQEt. 

DLITEYIDKNFTEK. . 
DSVYQIiESDIHKD . . 
DKVRNTi EKDL SKR . 



. . LSIaDDXAQHSGYTKWHLQRVFRKIVGMPLGEYXRRI^r 
• • VTLEALMQVAMSPFHliHRL^ 
. .DGVSGLAAQLGYT^ 
. pVTJOAFLAQAVAMSP 

. . LTI^SLMICHGSPYHMHRTFKKIKGITLVEYIQQvB^ 
. .WNLSMVMCLCLjSPSL^ 
. WTIiAI I ADEFNVjSE I Tjl RKRL E S EYI - TFNQX LMQ S 



CKITGIISFNIERQ WHIiKDi AELI YTSESLIKKRLRDEGT -SfTEIIiI 




RDACQYJSDHLADSn FDXASy^QHVCLSPSRLSHLFRQQLGISVLSWREDQ gX 

REACQYXSDHLADSn FD X ASVAQHVCL S P SR!l SHL!i|RQQLGXS VL SWREDQ g£ 

ieacqfetsnlage LRXDEyAJlHVCLSP SRIAHLFREQVGIUI lrwredc 

RDACQYXSDHL ADSh FDI ASVJ^QHVCL SP SRL SHLFRQQLGIgVIi SWREDC 

ASALTFliHRDPAHS WTyAELASAAAV^RST^AAR^ 

ATALTCIiHRDPARS WTjVADLADTAAVSRSTLAA^ 

NSIIQYJEENLESKf IOTDCLVL YS GF SRRYLQI SF5KEYVGMP IGTYXR^ 

DDVP QWliKSTVEKMhdk eqf s e S AI/ENI^AIjS AKSQE^TRATQRYYGKTPMQX-INE X|| 

DKVRNVIEKDL SRK WTLGIXADAFNV^EITiRICRIiESENT -NFNQILMQI 

DKVRGVXEKDLSRK WT^IXiUWFNvis^ 

DSVCRIIQSDIQHY WNIiRIj^A S SLCL SP SLjkKKKLKNENT - SYSQiVjTEC 

SRAREYVLENMSEP VTVLDIiCNQIiHV^SRRTLQNi^HAI LGX GPNAWLKR3 

SRAREY^LENMSEP LT^DLCNQLHVSRRTiiQNAFHAILGIGPNAWLKRl 

ERLQLFMEKHYLNE VTKIiSDF SREFGMGLTTFKELFG S VYGV§PRAV7TSI 

ERIVTLljFSDLTRK Wk|*SDXAEEMHI£EI SVRKRLEQECL -NFNC 

QKWTL^DDNIREE i LRPEWIAGETGMSVRS JYRMFADKGL -WAG t 

TRVCTVXNNNIAHE WTL ARjl ASELLM^P SLLKKKLREEGT -SSYSQLLTEC 

TRVCTVXNNNIAHE WTJArSaSELLMSPS^ 

TRVCTVXNNNIAHE WTXtARXASELLM^P SI^rKKKCiREEET -SYSQLLTEC 
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QHAVDFINTNYQKH IT$feDjV^^^ 

ERLQKFMEENYLQG WKfcSKFAREFGMGLTT|^ 

VLIDNYXEQHLQKK X SVAELS S VAFLAQS Q^ALFKSQMGITPHQY 

HSILDWlEDNLESP LSLEKVSERSGYSKWmQRMFjKKETGHSLGQ^^SI 

HSILDWiEDNLESP LS^EKVSERSGYgKWHgQRI^KICETGHSLGQlxRSI 

SQMLGFIAENYDQA LT^OTVAEHVKI^ANYAMGIPQRVMQL'^ 

DGLHAYMREHLHAR LE!fcER£i^CNL9KF^^ 

NQVKKXXHSQYGSS IiRVND^AKKLN^ 

YHLVLYtiLRTIEKEk eVRIKSLTEHYGvjsEAYFRSLCRKALGAKVKEQI 

YHLVL YliLRT I EKEk eVRXKSLTEHYGVSKAYFRSLCIlKALGAKVKEQi 

TRVRRLLLARPGDF PdLeQAARELHTs"gRs!dRRh£sSLGT-^QQ^ 

HAARDLliVGALQEP PSLDTLA'SRVGMOTRKLTAGFRKVFGASVFGYLQE 

DRVXKVXELDX SKN WKLGDV^S S SMFMSDSCJj.RKQIjNKENIj - TFKKIMLI 

KKALRYXDAHLSDD LRIiEDyASHVYLs|PYYFS!^ 

NDILKV^ETQLQRNe glKiDTXANKSGYSKWB^QRIFKDFKGCTLGE 

NLAVSYjLQENYSTG CTIMDLCHYIjnjSR 

DTIVEwbDDNLHQP LR&EDXAjRHAGYSK^ 

DTIVEWiDDNLHQP LRXDDtARHAGYSKVmQRLFLQYKGEgLGRYIRER gL 

DKLITRLAASLKSP Fi^DKFCTEASCSERVLRQQFRQQTGMTINQYLRQ\ 

DKLITAIANSLECP FM^AFCQQEQCSERV^RQQFRAQTGM^XNQ ( YIAQ\ 

NLLLAWLEDHFADE VNiTOAW^QF SL§LRTJ*HRQ^QQTGLOTQRyi " " 
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TCPN_VIBCH/ 17 2 -2 6 9 
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THCR_RHOER/ 227-328 
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URERJPROMI / 17 1 - 2 6 8 
VIRF_SHIDY/ 161-258 
VTRF__YEREN/ 167-265 
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XYLR_ECOLI/288-386 
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XYLS_PSEPU/ 214-315 
XYSl_PSEPU/2 14 - 3 15 
XYS2_PSEPU/ 39-140 
XYS3_PSEPU/214-315 
XYS4_PSEPU/ 214-315 
Y4 FK_RHI SN/ 3 1 8 - 4 1 7 
YA52_HAEIN/l94-295 
YBBBJBACSU/ 166-264 
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YEAMJSCOLI/158-258 
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YI JO_ECOLI / 172-270 
YI S R_BACSU/ 183-281 
YKGA_EC0LI/19-117 
YKGD_ECOLl/ 177 -2 7 8 
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YPDC_ECOI*I / 1 84 - 2 8 2 
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ASIKMRVEQNLANGs FSiTDVAEAERITPRJ&S^ 

DKVRNLXEKDLSRK WTLGIIMAFNAslsiTIRKRpESEOT -NFN^IIiMQL^ 
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ERLQKFMEENYLQG. WK&SXFAREFGMGLTT^^ 
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ARAL_STRAT/202-300 
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TEAKRLLSSTND. . • KMGVXAE 1 
CRjAAILViRLTAK • . .SMLDXALSL: 
Q I AAQMF SRETL . . . p-^VVXAE 
QRALRLARAGV - . . .PFAETATL, 
CIiQKYHL, . . S^HEV^QRi 
XIHDGM. . .KASAAlftMR 




[SDTLRQ 
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Multiple sequence alignment of PROSITE PS00041, HTH_AraC family 1. 



AARP_PROST/72 - 114 
ADA_MYCTU/137 - 179 
AD AA__B ACSU/ 152-194 
ADIY_ECOLI/198-240 
AGGR_ECOLl/2 13 - 2 5 5 
APPYJ3COLI/188-230 
ARAC_CITFR/ 231-273 
ARAC^E CO L I / 2 3 1 - 2 7 3 
ARAC_ERWCH/236-278 
ARACJSALTY/231-273 
ARALJ3TRAT/252-294 
ARAL_STRLl/252 -294 
CELD_ECOLI/226-268 
CFAD_ECOLl/211-255 
CSVR_ECOLl/215-257 
ENVY_ECOLI/ 1 9 8 - 24 0 
EUTR_ECOLl/ 2 93 - 3 3 8 
EUTR_SALTY/ 2 93 - 3 3 8 
EXSA_PSEAE/ 221-263 
FAPR_ECOLI/ 2 03-245 
GADX_EC027/l94-236 
GADX_EC057/l94-236 
GADX_ECOLI/ 1 94 -2 3 6 
GLXA_RH IME / 2 7 3 - 3 1 5 
HRPB_RALSO/ 4 2 6 - 4 7 1 
INVF_SALTY/ 162-204 
LACR_S TAX Y/ 224-266 
LCRF_YERPE/ 2 17-259 
LUMQ_PHOLE/ 198-240 
MARAJECOL 1/64-106 
MARA_SALTY/ 64-106 
MELR_ECOL 1/244-286 
MMSRJPSEAE/251-293 
MSMRJSTRMU/ 2 2 6 - 2 6 8 
MXIE~SHIFL/15 1-193 
MXIE_SHIS0/151-193 
PCHR_PSEAE/ 2 5 1 - 2 92 
PERA_EC02 7 /2 1 7 - 2 5 9 
POCR_SALTY/ 245-287 
PQRA_PROVU/ 59-101 
RAFRJPEDPE/226-268 
RAMA_K3jEPN/ 59-101 
RHARJECOLI/ 2 5 9 - 3 0 1 
RHAR_SALTY/ 2 2 9 - 2 7 1 
RHAS_ECOLI/224-266 
RHASJS ALTY/ 2 2 4 - 2 6 6 
RNS_ECOLI/211-255 
SOXS_ECOLI/57-99 
SOXS_SALTY/ 57-99 
TCPN_VIBCH/ 221-263 
THCR_RHOER/ 2 7 7 - 3 2 2 
UIiERJBCOLI/220-262 
URER_PROMI/220-262 
VIRF^SHID Y/ 210-252 
VIRF_YEREN/ 2 17-259 



jnstsyfiic 
jddqly |srv 
jddqly |srv JckctGa 
Jddqly |srv fRkrviGvi 
JddqlyQsrv |Kkct|GaS 
jgs esaLsva SKrvl 




^ceAAkeliqtt. . . nl^idiAlkyQgdsqqsgakrgKiaylGi 
|KqtJ^vlfcett • . .nlpFgdvAfaa^ssirq|ndtVRlac r 
gVhaAKkyjLiqt . . . nkaXgdIAicvglanapy fi tl 
§MryAVne3tomd. . .gknXsq^Sqsc^nstsy |isv 
^skAiAlllildn. . . sygXsqJSnmi " as tsy flirl 
^Mry^Kkll tsn . . • sysInvVAqkc 

l^sqAKlljEiSt t rinpXatyGJrirv 

^Csq^KllltStt. . . rmpXatVGjrm; 
lyjirAKllIjqtt. . . qes:3panIGrw 
|Xsq£KllIiStt. . • rmpllatyGjrnv 

lljelTArqLreg. . . saptiaaXAhe-v^^ „ 

fxelAArqLreg. . .natiiasIAhsv^gsesaLsva jpCrvlj 
^Xnf AKkqliemt, . .nysVtdl^f ea^sspslQikt JfekltSf 
QirmSKaafcllle . nsyqXsq^Siimiglssasy |irv 

IskAAllLlen. . . syqSjsq^Simiiglssasy |iri |Nkhf GvjiflR 
IryAiVqmLlmd. . . nknX t orVAal cj^s s t sy |i s v pja f ydl\ 
^naVRre^ispwsasmtvikdA^av^ f g 

^LnaVRr eL i sp ws qs a t^dAAmqw^whl gq |a t d jfoql f Ae^ 
^JlyAlHqllflns . . . dms^dXAmeajcfflssgsy f tqs §krrf Gc5^ 
pnqi^Akf jrirs . . . dhqf ggffijs lvffitsvsv |ikt pCeyyj^^ 
|MqrALqliviy. . . gvsXkrV^vsc^hsvsy f iyv *~ - 
"^qrALqlJyih. . .gf sXkr^vsc^hsvsy |iyv ! _ 
MqrjALqllyih. . . gf BlkrVAvgcffihs vsy jiyv jknyyGm' 
fcrhj^Rrllxqqs . . .plsIpeXAyat^sspah§sna JCrlf Sq*. 
KeglRscttildsernpsnSidTAsrw^rsrsaLvkg^RkqfNei 
^MaqSLlnSveg. . -henitq^vnhfesspshHsselpeli^ - 
j^hjASqlfciht. . . stl^sdiSrqv^kdpll j skngrkhfEiSA 
J^lytiiHqlLlng. . . kmsXvdl&ne affl ssqsy ' tqs gRrrfGcf" 
^Ldl^KqlTaer , . . qkpfls q^Aa! c°s s qs s |sq 
^telAqkLkes . . .nepXlyX*Aery^esqqtLtrt 
g^telAqkkkes . . . nepXlyjLAer 

P'-nhVRalLsdt. . .dksJTldjraLlt 
eygcqlftdss. . .dqs^ar^§qa 
gM[krSQylIienp. . .klsjaexlsns' 
pivnGLldyjflh. . • nqtXtsASjran: 
ijLvnGLldVflh. . .nqtXtsAJ&a 
^reAHrmTrfcde . . .eanj^styAy* 

^hA'sif lirtt. . .dknXdelSclvgTOBtByHijtvgiseyywc^ 
^MvsARelLchs . • .dwsIasXirnl^sqtsygckv|RqtyQv^l 
JUleAAksLqek. . . dmsXldia!lmv^ssqat§tri fiCkhf Ntj3 
JijedAKqrLsts. . .nnsVqs^Anmvgkdsf t|ska ^rys.Ga's 
^llAArdl^rdt. • . dqrVyd^Clky^dsqqtBtrv JPrtf Nqi 
^chAQylilqhs . . . rllSsdXStecgedsnygsvv^retjGmii 
jlxchAQylLqhs . . .plmXseXSmqc^edsnygsvv gTretiarfTj 
3l^mk^RhljLrhs . . . easytdlAyrcgsdsnhgstlaRref^S 
5liikARhlIirhs . . . dhsVjteX^rc|^dsnhgstl3krefNw 
QLrmgKaajLllle » nsyqIsqJSnmi§Issasy§iri pnchy§v3 
^llAAvejLrtt. . . eroXf dXAmdl^vsqqtgsrv pxqf Dr^j 
ff[ill!AAveIirtt. • .erpXfditodlgvsqqtgsrvprefDi^l 
^sySXslKktg. . . ef k^kqilAyqe^aevsy§s tv ^CstmNvAj 
^qk^kdilradpasegVtelAqr^^lhvg^age^qtfGvS 
pgmALnybtfs. . .nysVf qishrc^gsnaygcdv^rkykmij 
ppiALny^itfs. . . nysVf qi;Shrcpgsnay|cda^rky^T[ 
|MhhAAklIilns . . .qsy^ndVSrli^sspsygirk^eyy^iM 
a^y^P^ 1 ^ 111 ^* • *lonsIvdX&ea^ssqsyatqs^rrf^c3 



jesqqtLtr 
Jrsssrgys 
Sddsyygsrl 
|sdsli 
Srsts] 




IsneXiftrl^f^A 



|as tsh fsnelfetrlGf SA 
|sp . ahgsia^kr^i^ 1 " 
as t sygikv pjeyyNt* 



Figure 2A 

5/12 



WO 2004/041209 



PCT/US2003/035205 



X YLR_BCOIiI / 3 3 8 - 3 8 0 
XYLR_HAEIN/33 8-380 
XYLS_PSEPU/ 264-309 
X YS 1 J?S E PU/ 264-309 
XYS2JPSEPU/ 89-134 
XYS3_PSEPU/ 264-309 
XYS4_PSEPU/ 264-309 
Y4FK_RHISN/ 368-411 
YA52_HAEIN/247-289 
YBBB~BACSU/ 216-258 
YCGK^ALTCA/ 117 - 157 
YDEO_BCOLI/l86-227 
YEAM_E COL I / 2 1 1 - 2 5 2 
YFI F_BACSU/ 242-283 
YHI W_ECOLI / 1 8 8 - 2 3 0 
YIDL_ECOIil/247-289 
YI JO J3COLI / 222-264 
YI SR_BACSU/ 2 3 3 - 2 7 5 
YMCR_S TRLA / 2 3 4 - 2 7 5 
YPDC_ECOLI/234-276 
ADA_ECOLI / 8 6 - 12 8 
ADA_ECOLl/ 13 6 - 177 
ADA_SALTY/ 86-128 
ADAJSALTY/ 136-177 
CAFR_YERPE/ 56-101 
ROB_ECOLl/55-100 
TETD_ECOLI / 78-123 



ekARslXiist. . > tlsineXSqmcggpslqy{|ysvgkkayDt'] 
glsrAKnlJj'qgt — di sXkeXTe i c pfe s i ctvBvh vffiyka f Rml 
^esIRaclLndpsanvrsItel^ldy^^hlgr^en^safGei 
^esIRacLndpsanvrsIteliQdy^lhlgrQaen^safdel 
^ecVRacpsnpttnirsljteVAld 

ecIRarIisdpnanvrsVteMAldv^£htgr§aen§Rstf|Ge: 
i'ecIRarLsdpnanvrsVtel^dvf^ 

iaaAHgalkagr * . agsp;tetiAlnl^snpgr§3vl^say<315 
jjLqsAlAf lj&kqs . . . qqsyialAlev^qseahgckv^nyyQlj 
^tkAKrlMaks — nckUcel Aha t°qde f ygs r iQKky tjgc'i 

lleqAiCkvfllk ksjVteTA!yev^nnsny§a t\ 

^qhAKnltfrve. . • . gsVnkIAeqc|ggas tsy{3iyaBRkh£GnS 

QLimALqg^vkg dtVqkVAJitl^ds t ta§i tm^kgl^c 

^nkAAelljkst. . . nlsfkelA'eeif^sv . hy§trv§SakiGsi 



tf t GyJ 
xtErJS 



adqahLardVRemaGsSL 



smARrlpelr, . . qipLhtXAekc[gjgss tsvflin 
gLdlALslLkqq. . •gns^geVAdtlN§f dsf h§ski 
^ehSKtliikgy . . . dlkVjkeVichai 
2^teAKrl!tisst. . •ndkMgvIAet" 
2^qrALrlArag. . . .vpFaeTAtla 

|WakARmi£qky. . .hlsXheyAqrc^pdsdyQcrvgRrqf Gl^] 
Ijth^Crl^eqet . .pvt^ea^dqvAMsp . £hLhrl^att|(3m(l 
jjArrLResLakg, . . > esVtt^ln a^^ dsssYyxkADetlGgfPA 
§5ac^Cr lltteqe t . . pvttaf iAqavAMsp . f hLhr l|J§as tjfSmS^ 
jjjArrLReafiakg. . . .epXtaMyrag^dsss^rhADqtli^TA 
3^r£sraAallrltrltj^ 



rrLSksAvalrltarpXldlAilgyRgdsgq 
rrLTkaAvelrltkktilelAlkyQidsqqs 



tragRkqfA* 
jtrr§Kyi£K< 
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FIGURE 3B 
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FIGURE 3D 
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SEQUENCE LISTING 
<110> Paratek Pharmaceuticals, Inc. et al. 

<120> TRANSCRIPTION FACTOR MODULATING COMPOUNDS AND METHODS 
OF USE THEREOF 

<130> PAZ-054CP2PC 

<150> 60/423,319 
<151> 2002-11-01 

<150> 60/425,916 
<151> 2002-11-13 

<160> 4 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 7878 
<212> DNA 

<213> Echerichia coli 

<220> 
<221> CDS 

<222> (4124) ... (4843) 
<400> 1 

gttaactgtg gtggttgtca ccgcccatta cacggcatac agctatatcg agccttttgt 60 
acaaaacatt gcgggattca gcgccaactt tgccacggca ttactgttat tactcggtgg 120 
tgcgggcatt attggcagcg tgattttcgg taaactgggt aatcagtatg cgtctgcgtt 180 
ggtgagtacg gcgattgcgc tgttgctggt gtgcctggca ttgctgttac ctgcggcgaa 240 
cagtgaaata cacctcgggg tgctgagtat tttctggggg atcgcgatga tgatcatcgg 300 
gcttggtatg caggttaaag tgctggcgct ggcaccagat gctaccgacg tcgcgatggc 360 
gctattctcc ggcatattta atattggaat cggggcgggt gcgttggtag gtaatcaggt 420 
gagtttgcac tggtcaatgt cgatgattgg ttatgtgggc gcggtgcctg cttttgccgc 480 
gttaatttgg tcaatcatta tatttcgccg ctggccagtg acactcgaag aacagacgca 540 
atagttgaaa ggcccattcg ggcctttttt aatggtacgt tttaatgatt tccaggatgc 600 
cgttaataat aaactgcaca cccatacata ccagcaggaa tcccatcaga cgggagatcg 660 
cttcaatgcc acccttgccc accagccgca taattgcgcc ggagctgcgt aggcttcccc 720 
acaaaataac cgccaccagg aaaaagatca gcggcggcgc aaccatcagt acccaatcag 780 
cgaaggttga actctgacgc actgtggacg ccgagctaat aatcatcgct atggttcccg 840 
gaccggcagt acttggcatt gccagcggca caaaggcaat attggcactg ggttcatctt 900 
ccagctcttc cgacttgctt ttcgcctccg gtgaatcaat cgctttctgt tgcggaaaga 960 
gcatccgaaa accgataaac gcgacgatta agccgcctgc aattcgcaga ccgggaatcg 1020 
aaatgccaaa tgtatccatc accagttgcc cggcgtaata cgccaccatc atgatggcaa 1080 
atacgtacac cgaggccatc aacgactgac gattacgttc ggcactgttc atgttgcctg 1140 
ccaggccaag aaataacgcg acagttgtta atgggttagc taacggcagc aacaccacca 1200 
gccccaggcc aattgcttta aacaaatcta acattggtgg ttgttatcct gtgtatctgg 1260 
gttatcagcg aaaagtataa ggggtaaaca aggataaagt gtcactcttt agctagcctt 1320 
gcatcgcatt gaacaaaact tgaaccgatt tagcaaaacg tggcatcggt caattcattc 1380 
atttgactta tacttgcctg ggcaatatta tcccctgcaa ctaattactt gccagggcaa 1440 
ctaatgtgaa aagtaccagc gatctgttca atgaaattat tccattgggt cgcttaatcc 1500 
atatggttaa tcagaagaaa gatcgcctgc ttaacgagta tctgtctccg ctggatatta 1560 
ccgcggcaca gtttaaggtg ctctgctcta tccgctgcgc ggcgtgtatt actccggttg 1620 
aactgaaaaa ggtattgtcg gtcgacctgg gagcactgac ccgtatgctg gatcgcctgg 1680 
tctgtaaagg ctgggtggaa aggttgccga acccgaatga caagcgcggc gtactggtaa 1740 
aacttaccac cggcggcgcg gcaatatgtg aacaatgcca tcaattagtt ggccaggacc 1800 
tgcaccaaga attaacaaaa aacctgacgg cggacgaagt ggcaacactt gagtatttgc 1860 
ttaagaaagt cctgccgtaa acaaaaaaga ggtatgacga tgtccagacg caatactgac 1920 
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gctattacca ttcatagcat tttggactgg atcgaggaca acctggaatc gccactgtca 1980 
ctggagaaag tgtcagagcg ttcgggttac tccaaatggc acctgcaacg gatgtttaaa 2040 
aaagaaaccg gtcattcatt aggccaatac atccgcagcc gtaagatgac ggaaatcgcg 2100 
caaaagctga aggaaagtaa cgagccgata ctctatctgg cagaacgata tggcttcgag 2160 
tcgcaacaaa ctctgacccg aaccttcaaa aattactttg atgttccgcc gcataaatac 2220 
cggatgacca atatgcaggg cgaatcgcgc tttttacatc cattaaatca ttacaacagc 2280 
tagttgaaaa cgtgacaacg tcactgaggc aatcatgaaa ccactttcat ccgcaatagc 2340 
agctgcgctt attctctttt ccgcgcaggg cgttgcggaa caaaccacgc agccagttgt 2400 
tacttcttgt gccaatgtcg tggttgttcc cccatcgcag gaacacccac cgtttgattt 2460 
aaatcacatg ggtactggca gtgataagtc ggatgcgctc ggcgtgccct attataatca 2520 
acacgctatg tagtttgttc tggccccgac atctcggggc ttattaactt cccaccttta 2580 
ccgctttacg ccaccgcaag ccaaatacat tgatatacag cccggtcata atgagcaccg 2640 
cacctaaaaa ttgcagaccc gttaagcgtt catccaacaa tagtgccgca cttgccagtc 2700 
ctactacggg caccagtaac gataacggtg caacccgcca ggtttcatag cgtcccagta 2760 
acgtccccca gatcccataa ccaacaattg tcgccacaaa cgccagatac atcagagaca 2820 
agatggtggt catatcgata gtaaccagac tgtgaatcat ggttgcggaa ccatcgagaa 2880 
tcagcgaggc aacaaagaag ggaatgattg ggattaaagc gctccagatt accagcgaca 2940 
tcaccgccgg acgcgttgag tgcgacatga tctttttatt gaagatgttg ccacacgccc 3000 
aactaaatgc tgccgccagg gtcaacataa agccgagcat cgccacatgc tgaccgttca 3060 
gactatcttc gattaacacc agtacgccaa aaatcgctaa ggcgatcccc gccaattgtt 3120 
tgccatgcag tcgctccccg aaagtaaacg cgccaagcat gatagtaaaa aacgcctgtg 3180 
cctgtaacac cagcgaagcc agtccagcag gcataccgaa gttaatggca caaaaaagaa 3240 
aagcaaactg cgcaaaactg atggttaatc cataccccag cagcaaattc agtggtactt 3300 
tcggtcgtgc gacaaaaaag atagccggaa aagcgaccag cataaagcgc aaaccggcca 3360 
gcatcagcgt ggcatgttat gaagccccac tttgatgacc acaaaattta gcccccatac 3420 
gaccactacc agtagcgcca acaccccatc ttttcgcgac attctaccgc ctctgaattt 34 80 
catcttttgt aagcaatcaa cttagctgaa tttacttttc tttaacagtt gattcgttag 3540 
tcgccggtta cgacggcatt aatgcgcaaa taagtcgcta tacttcggat ttttgccatg 3600 
ctatttcttt acatctctaa aacaaaacat aacgaaacgc actgccggac agacaaatga 3660 
acttatccct acgacgctct accagcgccc ttcttgcctc gtcgttgtta ttaaccatcg 3720 
gacgcggcgc taccgtgcca tttatgacca tttacttgag tcgccagtac agcctgagtg 3780 
tcgatctaat cggttatgcg atgacaattg cgctcactat tggcgtcgtt tttagcctcg 3840 
gttttggtat cctggcggat aagttcgaca agaaacgcta tatgttactg gcaattaccg 3900 
ccttcgccag cggttttatt gccattactt tagtgaataa cgtgacgctg gttgtgctct 3960 
tttttgccct cattaactgc gcctattctg tttttgctac cgtgctgaaa gcctggtttg 4020 
ccgacaatct ttcgtccacc agcaaaacga aaatcttctc aatcaactac accatgctaa 4080 
acattggctg accatcggtc cgccgctcgg cacgctgttg gta atg cag age ate 4135 

Met Gin Ser lie 
1 

aat ctg ccc ttc tgg ctg gca get ate tgt tec gcg ttt ccc atg ctt 4183 
Asn Leu Pro Phe Trp Leu Ala Ala lie Cys Ser Ala Phe Pro Met Leu 
5 10 .15 20 

ttc att caa att tgg gta aag cgc age gag aaa ate ate gee acg gaa 4231 
Phe lie Gin lie Trp Val Lys Arg Ser Glu Lys He He Ala Thr Glu 
25 30 35 

aca ggc agt gtc tgg teg ccg aaa gtt tta tta caa gat aaa gca ctg 4279 
Thr Gly Ser Val Trp Ser Pro Lys Val Leu Leu Gin Asp Lys Ala Leu 
40 45 50 



ttg tgg ttt ace tgc tct ggt ttt ctg get tct ttt gta age ggc gca 4327 
Leu Trp Phe Thr Cys Ser Gly Phe Leu Ala Ser Phe Val Ser Gly Ala 
55 60 65 
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ttt get tea tgc att tea caa tat gtg atg gtg att get gat ggg gat 4375 
Phe Ala Ser Cys lie Ser Gin Tyr Val Met Val lie Ala Asp Gly Asp 
70 75 80 

ttt gee gaa aag gtg gtc gcg gtt gtt ctt ccg gtg aat get gee atg 4423 
Phe Ala Glu Lys Val Val Ala Val Val Leu Pro Val Asn Ala Ala Met 
85 90 95 100 

gtg gtt acg ttg caa tat tec gtg ggc cgc cga ctt aac ccg get aac 4471 
Val Val Thr Leu Gin Tyr Ser Val Gly Arg Arg Leu Asn Pro Ala Asn 
105 no 115 

ate cgc gcg ctg atg aca gca ggc acc etc tgt ttc gtc ate ggt ctg 4519 
lie Arg Ala Leu Met Thr Ala Gly Thr Leu Cys Phe Val lie Gly Leu 
120 125 130 

gtc ggt ttt att ttt tec ggc aac age ctg eta ttg tgg ggt atg tea 4567 
Val Gly Phe lie Phe Ser Gly Asn Ser Leu Leu Leu Trp Gly Met Ser 
135 140 145 

get gcg gta ttt act gtc ggt gaa ate att tat gcg ccg ggc gag tat 4615 
Ala Ala Val Phe Thr Val Gly Glu lie He Tyr Ala Pro Gly Glu Tyr 
150 155 fc 160 

atg ttg att gac cat att gcg ccg cca gaa atg aaa gee age tat ttt 4663 
Met Leu He Asp His He Ala Pro Pro Glu Met Lys Ala Ser Tyr Phe 
165 170 175 180 

tec gee cag tct tta ggc tgg ctt ggt gee gcg att aac cca tta gtg 4711 
Ser Ala Gin Ser Leu Gly Trp Leu Gly Ala Ala He Asn Pro Leu Val 
185 190 195 

agt ggc gta gtg eta acc age ctg ccg cct tec teg ctg ttt gtc ate 4759 
Ser Gly Val Val Leu Thr Ser Leu Pro Pro Ser Ser Leu Phe Val He 
200 205 210 

tta gcg ttg gtg ate att get gcg tgg gtg ctg atg tta aaa ggg att 4807 
Leu Ala Leu Val He He Ala Ala Trp Val Leu Met Leu Lys Gly He 
215 220 225 

cga gca aga ccg tgg ggg cag ccc gcg ctt tgt' tga tttaagtcga 4853 
Arg Ala Arg Pro Trp Gly Gin Pro Ala Leu Cys * 
230 235 



acacaataaa gatttaattc agecttegtt 
gagatgaaaa ttaaggtaag cgaggaaaca 
tgggtaatat gaatgttttt atggccgtac 
ccgcgtattt cagccacaaa tgggatgact 
gccccttgtt acagttgtgt acaaggggee 
agtaaacegg cggtgaatgc ttgcatggat 
cattttcctg cactgataac gaatcgttga 
taaaeggage ttaaactegg ttaatcacat 
ttccggtttg cttaccctca tacattgccc 
gctcttcagg aaatgegega ctcacacctg 
aatgtgtgat ggcatggtta tcgactaact 
cttcatcatt ageegctttg acaataatga 
tttegtaate acgcgtccaa ctggctaagt 
cgattaaatg ccegtagtat cattaaccaa 
ataaagattc agaggctcag cgttgcgtaa 
acgacccggc aatcccgtca aaacatccat 
gtaatcggtt aatgccgcag taaaagaaag 



taggttacct ctgetaatat ctttctcatt 4913 
caccacacca taaaeggagg caaataatgc 4973 
tgggaataat tttattttct ggttttctgg 5033 
aatgaacgga gataatccct cacctaaccg 5093 
tgatttttat gaeggegaaa aaaaaccgcc 5153 
agatttgtgt tttgetttta cgctaacagg 5213 
cacagtagca tcagttttct caatgaatgt 5273 
tttgttcgtc aataaacatg cagegattte 5333 
ggtccgctct tccaatgacc acatccagag 5393 
ctgtcacggt aatgttgata tgcccttcag 5453 
ggcaaattct gacacctgca egacatgett 5513 
taaattcttc gcccccgtag cgataaaccg 5573 
aagttgccag ggtgcgtaat actacatcgc 5633 
tttaaategg tcaatatcca acaacattaa 5693 
ctgatgatca aaggattcat caagaacccg 5753 
attgetaegg ategtcagea aataaatttt 5813 
caacccctcc tgaaaggcgt egaaatgege 5873 
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gtcctgccag tgattttcaa caatagccag cattaattcc cgaccacagt tatgcatatg 5933 
ttgatgggca gaatccatta gccgaacgta aggtaattca tcgttatcga gtggccccag 5993 
atgatcaatc caccgaccaa actggcacag tccataagaa tggttatccg ttatttctgg 6053 
cttactggca tctctcgcga ccacgctgtg aaacatactc accagccact ggtagtgggc 6113 
atcgatagcc ttattgagat ttaacaagat ggcatcaatt tccgttgtct tcttgatcat 6173 
tgccactcct ttttcacagt tccttgtgcg cgctattcta acgagagaaa agcaaaatta 6233 
cgtcaatatt ttcatagaaa tccgaagtta tgagtcatct ctgagataac attgtgattt 6293 
aaaacaaaat cagcggataa aaaagtgttt aattctgtaa attacctctg cattatcgta 6353 
aataaaagga tgacaaatag cataacccaa taccctaatg gcccagtagt tcaggccatc 6413 
aggctaattt atttttattt ctgcaaatga gtgacccgaa cgacggccgg cgcgcttttc 6473 
ttatccagac tgccactaat gttgatcatc tggtccggct gaacttctcg tccatcaaag 6533 
acggccgcag gaataacgac attaatttca ccgctcttat cgcgaaaaac gtaacggtcc 6593 
tctcctttgt gagaaatcaa attaccgcgt agtgaaaccg aagcgccatc gtgcatggtt 6653 
tttgcgaaat caacggtcat tttttttgca tcatcggttc cgcgatagcc atcttctatt 6713 
gcatgaggcg gcggtggcgc tgcatcctgt tttaaaccgc cctggtcatc tgccaacgca 6773 
taaggcatga caagaaaact tgctaataca atggcctgaa atttcatact aactccttaa 6833 
ttgcgtttgg tttgacttat taagtctggt tgctattttt ataattgcca aataagaata 6893 
ttgccaattg ttataaggca tttaaaatca gccaactagc tgtcaaatat acagagaatt 6953 
taactcacta aagttaagaa gattgaaaag tcttaaacat attttcagaa taatcggatt 7013 
tatatgtttg aaaattatta tattggacga gcatacagaa aaagcaaatc acctttacat 7073 
ataaaagcgt ggacaaaaaa cagtgaacat taatagagat aaaattgtac aacttgtaga 7133 
taccgatact attgaaaacc tgacatccgc gttgagtcaa agacttatcg cggatcaatt 7193 
acgcttaact accgccgaat catgcaccgg cggtaagttg gctagcgccc tgtgtgcagc 7253 
tgaagataca cccaaatttt acggtgcagg ctttgttact ttcaccgatc aggcaaagat 7313 
gaaaatcctc agcgtaagcc agcaatctct tgaacgatat tctgcggtga gtgagaaagt 7373 
ggcagcagaa atggcaaccg gtgccataga gcgtgcggat gctgatgtca gtattgccat 7433 
taccggctac ggcggaccgg agggcggtga agatggtacg ccagcgggta ccgtctggtt 7493 
tgcgtggcat attaaaggcc agaactacac tgcggttatg cattttgctg gcgactgcga 7553 
aacggtatta gctttagcgg tgaggtttgc cctcgcccag ctgctgcaat tactgctata 7613 
accaggctgg cctggcgata tctcaggcca gccattggtg gtgtttatat gttcaagcca 7673 
cgatgttgca gcatcggcat aatcttaggt gccttaccgc gccattgtcg atacaggcgt 7733 
tccagatctt cgctgttacc tctggaaagg atcgcctcgc gaaaacgcag cccattttca 7793 
cgcgttaatc cgccctgctc aacaaaccac tgataaccat catcggccaa catttgcgtc 7853 
cacagataag cgtaataacc tgcag 7878 

<210> 2 
<211> 239 
<212> PRT 

<213> Echerichia coli 
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Thr Val Gly Glu lie lie Tyr Ala 
155 160 
His lie Ala Pro Pro Glu Met Lys 

170 175 
Leu Gly Trp Leu Gly Ala Ala He 
185 190 
Leu Thr Ser Leu Pro Pro Ser Ser 
205 

He He Ala Ala Trp Val Leu Met 
220 

Trp Gly Gin Pro Ala Leu Cys 
235 



<210> 3 
<211> 870 
<212> DNA 

<213> Echerichia coli 

<220> 

<221> CDS 

<222> (1) . . . (870) 

<400> 3 

atg gat cag gec ggc att att cgc gac ctt tta ate tgg ctg gaa ggt 48 
Met Asp Gin Ala Gly He He Arg Asp Leu Leu He Trp Leu Glu Gly 
1 5 10 15 

cat ctg gat cag ccc ctg teg etc gac aat gta gcg gcg aaa gca ggt 96 
His Leu Asp Gin Pro Leu Ser Leu Asp Asn Val Ala Ala Lys Ala Gly 
20 25 30 

tat tec aag tgg cac tta cag aga atg ttt aaa gat gtc act ggc cat 144 
Tyr Ser Lys Trp His Leu Gin Arg Met Phe Lys Asp Val Thr Gly His 
35 40 45 

get att ggc gcg tat att cgt get cgt cgt ttg teg aaa teg gcg gtc 192 
Ala He Gly Ala Tyr He Arg Ala Arg Arg Leu Ser Lys Ser Ala Val 
50 55 60 

gca eta cgc ctg act gcg cgt ccg att ctg gac ate gcg ctg caa tac 240 
Ala Leu Arg Leu Thr Ala Arg Pro He Leu Asp He Ala Leu Gin Tyr 
65 70 75 80 

cgc ttc gac tct caa cag aca ttt acc cgc gca ttc aag aag cag ttt 288 
Arg Phe Asp Ser Gin Gin Thr Phe Thr Arg Ala Phe Lys Lys Gin Phe 
85 90 "* 95 

gee cag act cct gca ctt tac cgc cgt tct cct gaa tgg age gee ttt 336 
Ala Gin Thr Pro Ala Leu Tyr Arg Arg Ser Pro Glu Trp Ser Ala Phe 
100 105 110 

ggt att cgc ccg ccg eta cgc ctg ggt gaa ttc act atg cca gag cac 384 
Gly He Arg Pro Pro Leu Arg Leu Gly Glu Phe Thr Met Pro Glu His 
115 120 125 

aaa ttt gtc acc ctg gaa gat acg ccg ctg att ggt gtt acc cag age 432 
Lys Phe Val Thr Leu Glu Asp Thr Pro Leu He Gly Val Thr Gin Ser 
130 135 140 
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tac tec tgt teg ctg gag caa ate tct gat ttc cgc cat gaa atg cgt 480 
Tyr Ser Cys Ser Leu Glu Gin He Ser Asp Phe Arg His Glu Met Arg 
145 150 155 160 

tat cag ttc tgg cac gat ttt etc ggc aac gcg ccg acc att ccg ccg 528 
Tyr Gin Phe Trp His Asp Phe Leu Gly Asn Ala Pro Thr He Pro Pro 
165 170 175 

gtg etc tac ggc ctg aat gaa acg cgt ccg agt cag gat aaa gac gac 576 
Val Leu Tyr Gly Leu Asn Glu Thr Arg Pro Ser Gin Asp Lys Asp Asp 
180 185 190 

gag caa gag gta ttc tat acc acc gcg tta gee cag gat cag gca gat 624 
Glu Gin Glu Val Phe Tyr Thr Thr Ala Leu Ala Gin Asp Gin Ala Asp 
195 200 205 

ggc tat gta ctg acg ggg cat ccg gtg atg ctg cag ggc ggc gaa tat 672 
Gly Tyr Val Leu Thr Gly His Pro Val Met Leu Gin Gly Gly Glu Tyr 
210 215 220 

gtg atg ttt acc tat gaa ggt ctg gga acc ggc gtg cag gag ttt ate 720 
Val Met Phe Thr Tyr Glu Gly Leu Gly Thr Gly Val Gin Glu Phe He 
225 2 30 235 240 

ctg acg gta tac gga acg tgc atg cca atg etc aac ctg acg cgc cgt 768 
Leu Thr Val Tyr Gly Thr Cys Met Pro Met Leu Asn Leu Thr Arg Arg 
245 250 255 

aaa ggt cag gat att gag cga tac tac ccg gca gaa gat gec aaa gcg 816 
Lys Gly Gin Asp He Glu Arg Tyr Tyr Pro Ala Glu Asp Ala Lys Ala 
260 265 270 

gga gat cgc cca att aat eta cgc tgt gaa ctg ctg att ccg ate cgt 864 
Gly Asp Arg Pro He Asn Leu Arg Cys Glu Leu Leu He Pro He Arg 
275 280 285 

cgt taa 870 
Arg * 



<210> 4 
<211> 289 
<212> PRT 

<213> Echerichia coli 

<400> 4 
Met 

1 S io 15 
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25 30 
Gin Arg Met Phe Lys Asp Val Thr Gl} 

40 45 
Arg Ala Arg Arg Leu Ser Lys Ser Ale 
55 60 
Arg Pro He Leu Asp He Ala Leu Gin 
65 70 75 80 

Thr Arg Ala Phe Lys Lys Gli 
90 95 

Arg Ser Pro Glu Trp Ser Ale 

105 HO 
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